hide
Free keywords:
-
Abstract:
Gallicagram is a lexicometry tool, based primarily on the archives of the French National Library and those of Le Monde newspaper. It counts the occurrences of a word and syntagma for a chosen corpus and a given period and offers several visualization options of the resulting data. For researchers, this software offers several assets: a large enough volume of data sufficient for lexicometric analysis from 1600 to present; transparency, which its competitor Ngram Viewer notably lacks; and a more constant structure throughout time. This article presents Gallibase, its extension which applies the tools of textual statistics, in particular factor analysis and tree clustering. It illustrates its potential and insists on the value of press corpora, which allows for the study of short periods.