a collection of projects and ideas
Machine Learning
Research Interests
visualization, dimensionality reduction, spatiotemporal data, spectral optimizations, semidefinite programming, data mining, graph algorithms, large datasets
Papers
Structure Preserving Embedding
Blake Shaw, Tony Jebara
International Conference on Machine Learning, ICML, June 2009.
Best Paper Award Winner
Download: Paper (PDF) | Poster (PDF) | Slides (PDF) | BibTex
View Talk: videolectures.net | MP4
Structure Preserving Embedding (SPE) is an algorithm for embedding graphs in Euclidean space such that the embedding is low-dimensional and preserves the global topological properties of the input graph. Topology is preserved if a connectivity algorithm, such as k-nearest neighbors, can easily recover the edges of the input graph from only the coordinates of the nodes after embedding.
Dimensionality Reduction, Clustering, and PlaceRank Applied to Spatiotemporal Flow Data
Blake Shaw, Tony Jebara
New York Academy of Science - Machine Learning Symposium 2009.
Download: Paper (PDF) | Poster (PDF)
Visualizing Graphs with Structure Preserving Embedding
Blake Shaw, Tony Jebara
Analyzing Graphs: Theory and Applications. NIPS Workshop. December 2008.
Download: Paper (PDF)
Graph Embedding with Global Structure Preserving Constraints
Blake Shaw, Tony Jebara
New York Academy of Science - Machine Learning Symposium, October 2008.
Download: Paper (PDF) | Poster (PDF)
Minimum Volume Embedding (NYAS)
New York Academy of Science - Machine Learning Symposium 2007.
Download: Paper (PDF)
Minimum Volume Embedding
Blake Shaw, Tony Jebara
Artificial Intelligence and Statistics, AISTATS, March 2007.
Download: Paper (PDF) | Poster | BibTex | Code
Minimum Volume Embedding (MVE) is an algorithm for non-linear dimensionality reduction that uses semidefinite programming (SDP) and matrix factorization to find a low-dimensional embedding that preserves local distances between points while representing the dataset in many fewer dimensions.
Optimizing Eigengaps and Spectral Functions using Iterated SDP
Tony Jebara, Blake Shaw, Andrew Howard -- Learning Workshop 2007.
Download: Paper (PDF)
B-matching for Embedding
Tony Jebara, Blake Shaw, Vlad Schogolev
Snowbird Machine Learning Conference, April 2006.
Download: Paper (PDF)
Teaching
Programming Languages (Matlab)
w3101 section 1 - Spring 2008
Earlier Projects in Computer Science at Columbia
Projects at Sense Networks
CitySense - Live San Francisco Nightlife Activity
MacroSense - Relevant Recommendation, Personalization and Discovery from Mobile Location Data
CabSense - The Smartest Way to Hail a Cab in NY
Patents
12/134,634 (pending) - System and Method of Performing Location Analytics
12/241,266 (pending) - Event Identification in Sensor Analytics
2/241,227 (pending) - Comparing Spatial-Temporal Trails in Location Analytics