PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Effects of Age and Gender on Blogging
Jonathan Schler, Moshe Koppel, Shlomo Argamon and James Pennebaker
In: AAAI Spring Symposium on Computational Approaches for Analyzing Weblogs, April 2006, Stanford, CA.


Analysis of a corpus of tens of thousands of blogs – incorporating close to 300 million words – indicates significant differences in writing style and content between male and female bloggers as well as among authors of different ages. Such differences can be exploited to determine an unknown author’s age and gender on the basis of a blog’s vocabulary.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
Information Retrieval & Textual Information Access
ID Code:2688
Deposited By:Jonathan Schler
Deposited On:22 November 2006