Title: Detecting the evolution phases of a text production
Authors: Madalina Olteanu - Pantheon-Sorbonne University (France)
Stephane Lamasse - Université Paris 1 (France) [presenting]
Abstract: The aim is to illustrate how one may identify the transformations of a text over time. We investigate the content of the Wikipedia pages of several famous researchers and historical figures in order to bring out their production phases. Temporal information on the content of a page, since its creation and with a high temporal resolution, may be available: the size of the page, the number of words, and, with more text-mining effort, the table of contents, ... We apply time-segmentation techniques (change-point detection) and semantic analysis for exploring the evolution of the pages and identifying key-events.