Working Groups/Linguistics
From Open Knowledge Foundation
Working Group on Open Data in Linguistics
Purpose
1. Promote the idea and definition, as specified in opendefinition.org of open data in Linguistics and in relation to language data.
2. Act as a central point of reference and support for those interested in open linguistic data.
3. Facilitate communication between researchers from different communities that use, distribute, or maintain open linguistic data.
4. Serve as a mediator between providers and users of technical infrastructure.
5. Build and maintain an index of open linguistic data sources and tools that link existing resources.
6. Assemble best-practice guidelines and use cases concerning creating, using and distributing data.
7. Gather information on legal issues surrounding linguistic data to the community.
Blog
Meetings and Workshops
We usually meet in intervals of 6 - 8 weeks, either in person or in skype. Aside from group meetings, we organize workshops.
Next:
- real-life meeting @ LREC 2012, Istanbul (between May 21th and 27th) [| doodle poll] (we try to arrange Skype dial-in)
Previous Meetings
- 2012, Apr 30th: telco (wg/linguistics/minutes/20120430)
- 2012, Mar 9th: real life meeting @ LDL 2012 (wg/linguistics/minutes/20120309)
- 2012, Mar 7th - 9th: workshop at [Linked Data in Linguistics (LDL 2012)], Frankfurt/M.
- 2012, Jan 25th: telco (wg/linguistics/minutes/20120115
- 2011, Dec 14th: telco (wg/linguistics/minutes/20111214)
- 2011, Oct 24th: real-life meeting @ ISWC 2011, Bonn (minutes)
- 2011, Jun 30th: workshop at the OKCon 2011
- 2011, May 27th: telco (minutes)
- 2011, Jan 18th: real-life meeting in Berlin (minutes)
- 2010, Dec 1st: real life meeting in Berlin
- 2010, Oct 26th: real-life meeting in Berlin
- 2010, Oct 19th: telco
Members
Members (incomplete, please add yourself)
- Armelle Boussidan, CNRS Lyon
- Christian Chiarcos, ISI/USC
- Sebastian Hellmann, Universität Leipzig
- Nancy Ide, Professor of Computer Science at Vassar College and Technical Director of the American National Corpus project
- Steven Moran, LMU Munich
- Sebastian Nordhoff, MPI for Evolutionary Anthropology
- Cornelius Puschmann, University of Düsseldorf
- Pablo Mendes, Freie Universität Berlin
- Zoltán Varjú, linguist advisor, Weblib LLC, Hungary
- Richard Littauer, University of Edinburgh
Possible Projects
- Collecting use cases and developing best practices recommendations for making linguistic data open.
- Maintaining a registry of collections of open corpora, dictionaries and other linguistic resources on CKAN
- Developing a Linked Open Data (sub)cloud of linguistic resources, cf. Linguistics Linked Open Data cloud page
- Developing a workflow repository and platform. Cf. Workflows page
Participate
- Open Linguistics Mailing List: http://lists.okfn.org/mailman/listinfo/open-linguistics
- Wiki page: http://wiki.okfn.org/wg/linguistics
- Etherpad: http://okfnpad.org/OWLG
Tasks
- Invite other prospective members:
- Ask around for other people to invite
- Discuss WG purpose, projects and ideas to be pursued by the group
- Participate in the projects identified above
- Collect case studies and best practice recommendations, e.g., for legal issues (licenses, copyright, etc.)
- Register data at CKAN and contribute to the Linguistics Linked Open Data cloud
Resources
Links
- Copyright issues of the PanLex project: http://utilika.org/info/panlex-ip.html