The 2nd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literatureto be held in conjunction with COLING 2018 in Santa Fe, NM, USA
First Call for Papers (with apologies for cross-posting)
Organisers: Beatrice Alex, Stefania Degaetano-Ortlieb, Anna Feldman, Anna Kazantseva, Nils Reiter, Stan Szpakowicz
LaTeCH-CLfL 2018 is a second joint meeting of two communities with overlapping research goals and a similar research focus. The SIGHUM Workshops on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) have been a forum for researchers who develop new technologies for improved information access to data from the broadly understood humanities and social sciences. The ACL Workshops on Computational Linguistics for Literature (CLfL) have focussed on applications of NLP to a wide variety of literary data. The first joint workshop (LaTeCH-CLfL 2017) brought together people from both communities. We count on this workshop to broaden the scope of our work even further, and to encourage new common research initiatives.
A highlight of the workshop will be Ted Underwood’s invited talk (https://tedunderwood.com/).
Topics and Content
In the Humanities, Social Sciences and Cultural Heritage communities, there is increasing interest in and demand for NLP methods for semantic annotation, intelligent linking, discovery, querying, cleaning and visualization of both primary and secondary data; this is even true of primarily non-textual collections, given that text is also the pervasive medium for metadata. Such applications pose new challenges for NLP research, such as noisy, non-standard textual or multi-modal input, historical languages, multilingual parts within one document, lack of digital resources, or resource-intensive approaches that call for (semi-)automatic processing tools and domain adaptation, or, as a last resort, intense manual effort (e.g., annotation).
Literary texts bring their own problems, because navigating this form of creative expression requires more than the typical information-seeking tools. Examples of advanced tasks include the study of literature of a certain period or sub-genre, recognition of certain literary devices, or quantitative analysis of poetry. More generally, there is a growing interest in computational models whose results can be interpreted in meaningful ways.
A common forum is mutually beneficial to NLP experts, data specialists, digital humanities researchers, and those who study literature. The second edition of the joint workshop has something for everyone in all such communities. We invite contributions on these, and closely related, topics:
• adapting NLP tools to Cultural Heritage, Social Sciences, and to the humanities including literature;
• fully- or semi-automatic creation of semantic resources;
• automatic error detection and cleaning of textual data;
• building and analyzing social networks of literary characters;
• complex annotation schemas, tools and interfaces;
• dealing with linguistic variation and non-standard or historical use of language;
• discourse and narrative analysis/modelling, notably in literature;
• emotion analysis for the humanities and for literature;
• generation of literary narrative, dialogue or poetry;
• identification and analysis of literary genres;
• linking and retrieving information from different sources, media, and domains;
• modelling dialogue literary style for generation;
• modelling of information and knowledge in the Humanities, Social Sciences, and Cultural Heritage;
• profiling and authorship attribution;
• research infrastructure and standardisation efforts in the Humanities, Social Sciences, and Cultural Heritage;
• searching for scientific and/or scholarly literature.
Information for Authors
We invite papers on original, unpublished work in the topic areas of the workshop. In addition to long papers, we will consider short papers and system descriptions (demos). We also welcome position papers.
* Long papers, presenting completed work, may consist of up to eight (8) pages of content, with two (2) additional pages of references.
* A short paper / demo can present work in progress, or the description of a system, and may consist of up to four (4) pages of content, with one (1) additional page of references.
* A position paper — clearly marked as such — should not exceed six (6) pages including references.
All submissions are to use the ACL stylesheets (either .sty and .bst or .dot). Papers should be submitted electronically, in PDF, via the LaTeCH-CLfL2018 submission website:
Reviewing will be double-blind. Please do not include the authors’ names and affiliations, or any references to Web sites, project names, acknowledgements and so on — anything that immediately reveals the authors’ identity. Self-references should be kept to a reasonable minimum, and anonymous citations cannot be used.
Accepted papers will be published in the workshop proceedings, and later available in the ACL Anthology.
Paper submission deadline: May 25, 2018
Notification of acceptance: June 20, 2018
Camera-ready papers due: June 30, 2018
Workshop date: August 20 or 21, 2017
More on the organisers
Beatrice Alex, School of Informatics, University of Edinburgh
Stefania Degaetano-Ortlieb, Language Science and Technology, Saarland University
Anna Feldman, Department of Linguistics & Department of Computer Science, Montclair State University
Anna Kazantseva, National Research Council of Canada
Nils Reiter, Institute for Natural Language Processing (IMS), Stuttgart University
Stan Szpakowicz, School of Electrical Engineering and Computer Science, University of Ottawa