Computational Stylistics Group
A cross-institutional research team focused on computer-assisted text analysis.
Computational Stylistics Group is a cross-institutional research team focused on computer-assisted text analysis, stylometry, authorship attribution, sentiment analysis, and the like stuff. The research projects conducted by the team members could be described as an intersection of linguistics, literary criticism, and computer science – however the best name here would be “Digital Humanities”. The group is based mostly in Kraków, at the Institute of Polish Language (Polish Academy of Sciences), but also at the Jagiellonian University and the University of Antwerp.
Even if the Group has been involved in several research projects (some of them are listed on this website, on the Projects subpage), it is probably known – at the first place – for the R package stylo, which is a comprehensive collection of functions written in the programming language R, for performing a variety of experiments in computational stylistics. More information about the package can be found here. Also, please check the discussion list dedicated to various issues in stylometry and beyond.
The Computational Stylistics Group is a member of the Federation of Stylometry Labs (FoSL), and closely collaborates with the COST Action “Distant Reading”, as well as the Digital Literary Stylistics Special Interest Group (SIG) affiliated with the Alliance of Digital Humanities Organizations. Being based mostly in Krakow, it is also a part of the DH Kraków initiative.
News
Mar 4, 2024 | The version 0.7.5 of the R package “stylo” released! Click here for further details. |
Jul 10, 2023 | The team is headed to the SIG-DLS workshop on computational stylistics; Ben about to deliver a keynote talk on meter that matters! Read more. |
Jan 13, 2023 | How distinctive are fictional characters in European drama? Check out our recent collaborative paper, available as arXiv preprint. |
Dec 18, 2022 | Check out Maciej’s idea to boost word frequencies; the paper was presented at CHR2022. |
Dec 10, 2022 | A paper on Piotrowski’s law and modeling the dynamics of language change; the following arXiv preprint leads you to full version for free. |
Sep 18, 2022 | Finally there! Among other things, we test whether lemmatization improves performance in stylometric tests; pay-walled version here: Stylistic fingerprints, POS-tags, and inflected languages: A case study in Polish, but here its a free arXiv preprint to download. |
Aug 9, 2022 |
Using the function stylo() in batch mode: a new blog post to be found here.
|
Jul 21, 2021 | Performance measures in supervised classification: a new blog post to be found here. |
Jun 24, 2021 | One word to rule them all: our talk on word embeddings in stylometry is on YouTube now! |
Jun 15, 2021 | Our paper on handwritten text recognition is out! Check out here. A pre-print version can be found here. Also, a talk about the project (in Polish) is available on YouTube. |