Metablake: Learning from a Visual Folksonomy: Automatically Annotating Images from Flickr

Monday, January 08, 2007

Learning from a Visual Folksonomy: Automatically Annotating Images from Flickr

Recently, a large visual dataset has emerged from a web-based photo service called Flickr which utilizes the organizational power of folksonomy to label a tremendous amount of visual data. Flickr users upload snapshots from their digital cameras to the web, and if marked as public, the community annotates these images with descriptive tags. Can this large collective labeling effort be used to train a computer to annotate images? What concepts are we able to train a computer to visually identify?

This project uses a simple crawler to download photos from Flickr labeled with a certain tag, and then extracts color and texture features from these images so that they can be used to train a classifier, such as a Support Vector Machine (SVM). By automating this process of downloading images, extracting features, training, and testing, we are able to apply our system to many different tags and see which tags correspond to identifiable visual features. We have found that the system performs relatively well annotating images with one label, selected from a small vocabulary, for images belonging to concepts with distinct color and texture features. (Full paper found here)

Spring 2006

Learning from a Visual Folksonomy: Automatically Annotating Images from Flickr

Visual Databases Project
Using SVMs to automatically label images from Flickr.

Fall 2005

Semidefinite Embedding: Applied to Visualizing Folksonomies

Adv. Machine Learning Project
Exploration of SDE and its applications to data mined from Del.icio.us.

CUtunes Project Report

Independent Project
End of the semester report for CUtunes.

Utilizing Folksonomy: Similarity Metadata from the Del.icio.us System

Web-enhanced Information Management
A Project which aggregates RSS feeds from del.icio.us and provides a browser which utilizes a novel kind of similarity metadata.

Building A Better Folksonomy

Web-enhanced Information Management
A paper talking about the ability of folksonomy to organize large datasets.

Spring 2005

Visualizing High-Dimensional Data

Machine Learning Project
Using Locally Linear Embedding to visualize CUtunes data.

Visualizing Clusters in Microarray Data

Computational Genomics
A tool utilizing LLE and NMF algorithms to effectively visualze clusters in gene expression data.

Fall 2004

Gesture Recognition

Computer Vision Project
A simple visual gesture recognition system written in Java for Mac OS X.

Summer 2004

Forest Fire Image Segmentation

Summer Job
Here is some work I did for identifying the borders of forest fires in aerial images.

Spring 2004

Parallel Searching Techniques

Parallel Computing
A package for parallel statespace searching.

Programming Languages and Translators
A small innovative language for grid-based visualization.

Summer 2003 and earlier

Summer Job
Summary of work I did at the plasma lab at Columbia.

Advanced Programming
A simple yet addictive game.