English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Gallicagram : les archives de presse sous les rotatives de la statistique textuelle

MPS-Authors
/persons/resource/persons281889

De Courson,  Benoît
Criminology, Max Planck Institute for the Study of Crime, Security and Law, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

corpus-7944.pdf
(Any fulltext), 2MB

Supplementary Material (public)
There is no public supplementary material available
Citation

De Courson, B., Azoulay, B., De Courson, C., Vanni, L., & Brunet, É. (2023). Gallicagram: les archives de presse sous les rotatives de la statistique textuelle. Corpus, 24. doi:10.4000/corpus.7944.


Cite as: https://hdl.handle.net/21.11116/0000-000C-938B-1
Abstract
Gallicagram is a lexicometry tool, based primarily on the archives of the French National Library and those of Le Monde newspaper. It counts the occurrences of a word and syntagma for a chosen corpus and a given period and offers several visualization options of the resulting data. For researchers, this software offers several assets: a large enough volume of data sufficient for lexicometric analysis from 1600 to present; transparency, which its competitor Ngram Viewer notably lacks; and a more constant structure throughout time. This article presents Gallibase, its extension which applies the tools of textual statistics, in particular factor analysis and tree clustering. It illustrates its potential and insists on the value of press corpora, which allows for the study of short periods.