Your dream corpus is already there
In: 7th International Conference on Language Resources and Evaluation (LREC-2010), 19-21 May 2010, La Valletta, Malta.
The tutorial will provide the attendee with an in-depth look at how to tap
the potential of web data as a source of linguistic resources that go beyond
frequency counts. Special focus will be given to linguistic resources that can
be compiled from web data that was constructed for entirely different purposes.
The tutorial will combine the theoretic analysis with the discussion of concrete
examples, the problems that had to be overcome, and the solutions provided.