GSoC 2011

= Google Summer of Code 2011 at the Open Knowledge Foundation =

Below are some ideas for Open Knowledge Foundation projects which could be interesting to those applying to participate in Google Summer of Code 2011.

Many projects will require some knowledge of either Python or Javascript. Knowledge of any of the following is also an advantage (but by no means essential!): CSS, Pylons/Pyramid, PostgreSQL, SQLAlchemy, Genshi, Solr, AMQP, and RDF.

CKAN - the Data Hub

 * Summary: The easy way to get, use and share data
 * Project URL: http://ckan.org/ + site at http://ckan.net/
 * Code: https://bitbucket.org/okfn/ckan/src
 * Mailing list: ckan-discuss

Tasks

 * Improve datapkg, our command-line apt-get for data. While this can already down- and upload data, it needs to become more easily usable to turn it into the swiss army knife of anyone working with Open Data.

Ideas

 * Automated validation of datasets
 * Automatically
 * Integration with Google Refine or a similar data cleaning tool
 * 'Dippets' - data snippets. Small code samples for working with a dataset
 * Improved integration between getthedata (Q&A for data and ckan)
 * E.g. Auto link GetTheData question to CKAN datasets and vice versa
 * 'Watch'
 * CKAN plugins

Open Spending / Where Does My Money Go?

 * Summary: Tracking every (public) government and corporate financial transaction across the world and present it in useful and engaging forms for everyone from a school-child to a data geek.
 * Project URL: http://openspending.org/ + http://wheredoesmymoneygo.org/
 * Code: http://bitbucket.org/okfn/wdmmg/
 * Mailing list: wdmmg-discuss + wdmmg-dev

Ideas

 * Merge interface for consolidating or linking different representations of same entity (e.g. same Company listed differently e.g. Google Inc, Google, Google Ltd)
 * New visualizations -- e.g. develop

Tasks

 * Create data journalism tools for investigators, allowing for easy drill-down and export of data.
 * Link OpenSpending to the Google Vis API to allow people to compose their own visualizations of spending.
 * Develop a tool to automatically link news stories to spending items.

Bibliographica

 * Summary: Wordpress for bibliographies. Easily create and share lists of publications.
 * Project URL: http://bibliographica.org/
 * Code: https://bitbucket.org/okfn/openbiblio/src
 * Mailing list: openbiblio-dev

Ideas

 * Lots of RDF / Semantic Web data acquisition
 * AJAX-based interface for adding relations between different books (e.g. this book refers to this book)

Annotator

 * Summary: Annotate any document on the web. 2 lines of javascript you can insert into any page, allowing you to select and annotate text, images, or (almost) anything else.
 * URL: http://annotateit.org/
 * Code: https://github.com/okfn/annotator
 * Mailing list: okfn-dev