Lectures and Workshops – Abstracts – Summer school digital humanities

Monday June 8^th

Michaela Mahlberg (FAU Erlangen-Nürnberg & University of Birmingham)
Lecture: The fact and fiction continuum in the age of AI
The spread and scale of AI generated texts open up a range of questions about the role of language in society. Some of these questions are clearly not new, but the AI boom has given them a new urgency. In this lecture, my focus will be on how we make sense of the world through language – and specifically through stories. Storytelling combines conventional language patterns with the creative use of language. It is this combination that is typically drawn on in the discussion of ‘fictional’ vs ‘non-fictional’ texts. I will provide examples from case studies that look at linguistic features for the distinction of different registers as well as features of fictional world creation to discuss the alignment of AI generated text and human-produced texts. I will argue that a ‘fact and fiction continuum’ provides a useful basis for the evaluation of AI generated text.

Workshop: The fact and fiction continuum in the age of AI
In the workshop, we will build on the theoretical claims from the lecture and discuss them on the basis of text examples (human and AI-generated).

Suggested pre-reading:
(1) Roland, E., & So, R. J. (2026). Generative AI & Fictionality: How Novels Power Large Language Models (arXiv:2603.01220). arXiv. https://doi.org/10.48550/arXiv.2603.01220

Tuesday June 9^th – Morning Session

Lynne Bowker (Université Laval)
Lecture: Exploring the potential and pitfalls of AI for multilingual scholarly publishing
Since the mid twentieth century, English has become embedded as the central language for scholarly communication, creating inequities for non-Anglophone scholars, as well as for science and society more broadly. AI tools such as neural machine translation and large language models have the potential to foster a more multilingual scholarly communication ecosystem, but they are not a panacea. This lecture weighs some of the gains and losses, as well as the challenges and opportunities, that arise when we apply AI to multilingual scholarly publishing.

Suggested pre-reading:
(1) Amano, T. et al. 2023. The manifold costs of being a non-native English speaker in science. PLoS Biology 21(7): e3002184. https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3002184
(2) Bowker, L., M. Laakso, and J. Pölönen. 2025. Making the case for multilingual scholarly communication. Canadian Journal of Information and Library Science 48(1): 112-116. https://doi.org/10.5206/cjils-rcsib.v48i1.22292

Workshop: Developing a (mini) business case for multilingual scholarly events
This workshop issues a challenge to participants. Imagine that you are going to organize a multilingual one-day student event in Digital Humanities. Some of the aspects that you need to consider include how many (and which) languages will be included, how to manage a multilingual website and call for proposals, conducting multilingual peer review of submissions, supporting the delivery of posters and/or presentations in different languages, and publishing multilingual proceedings. Working in groups, participants will consider which types of AI tools could be used to support these tasks, where AI may be less desirable, and what other solutions could be used instead. How or where might AI tools be integrated into the workflow? What AI-related policies might need to be developed or implemented? What justifications can you provide for your decisions?

Suggested pre-reading:
(1) Burton-Jones, Andrew, et al. 2025. This article is not just in English: Making science more inclusive and impactful with artificial intelligence translation. Australasian Journal of Information Systems 29. https://doi.org/10.3127/ajis.v29.5875
(2) Warburton, Kara. 2024. Developing a business case for managing terminology. http://termologic.com/wp-content/uploads/2024/05/roi-article-warburton.pdf

Tuesday June 9^th – Afternoon Session

Marcus Müller (TU Darmstadt)
Lecture: Natural Meaning and Artificial Intelligence. A View from Corpus Linguistics
In my lecture, I will report on experiments in Word Sense Disambiguation with Large Language Models. Academic discourse relies on clear, accurate and contextually abstract terminology. However, unambiguous terms are more commonly found in DIY stores than in academic articles. This is because academic terms often refer to abstract entities that evolve within discourse. Fundamental terms such as “structure”, “norm” and “system” for example, are further developed and redefined in academic discourse, migrating from one disciplinary field to another and thus changing their scope of reference. This is particularly evident in the humanities and social sciences.
Our example terms are drawn from the field of political science, specifically International Relations research, which is characterised by two features: Firstly, it involves theorising at a high level of abstraction. Secondly, it establishes the relevance of the research by referencing concrete events and institutions in international politics. This creates a hybrid context for terminologisation, in which words such as “cooperation”, “discourse”, “norm” or “anarchy” are often used in the same text with the meanings of both everyday political language and specialised terminology.
We conducted the experiments within the interdisciplinary project Terminological Innovation in International Relations, which traces the trajectories of key political science terms — such as “regime”, “governance”, and “international anarchy” — as they circulate between academic discourse, policy consulting, and practical politics (1976–2000).

Workshop: Natural Meaning and Artificial Intelligence. A View from Corpus Linguistics
In the workshop that follows, I would like to discuss Word Sense Disambiguation with you. We will look at the opportunities and risks of using Large Language Models for this purpose from a linguistic perspective. To this end, we will carry out some short exercises involving artificial and natural intelligence. The platform we will use is the Cloudflare AI playground: https://playground.ai.cloudflare.com/
I will provide you with test data and instructions just before the workshop starts.

Suggested pre-reading:
(1) Kroeger, Paul R. Analyzing Meaning: An Introduction to Semantics and Pragmatics. Language Science Press, 2023. https://openresearchlibrary.org/content/59084ac7-e512-46df-abb4-9280bc5f9696 (chap. 5: Word Senses, pp. 75-104)
(2) Basile, Pierpaolo, Lucia Siciliani, Elio Musacchio, and Giovanni Semeraro. 2025. “Exploring the Word Sense Disambiguation Capabilities of Large Language Models.” https://arxiv.org/abs/2503.08662

Thursday June 11^th – Morning Session

Nataliia Laba (University of Groningen)
Lecture: Text-image collapse: the challenge of multimodal generative AI
One of the challenges of multimodal generative AI concerns how we create and make sense of images. Although AI-generated images have attracted widespread public attention only relatively recently, decades of theoretical and artistic work have already laid the groundwork for understanding the image as a computational object – technical (Flusser), algorithmic (Somaini), networked (Dewdney & Sluis), or operational (Parikka). Building on these genealogies, this lecture addresses the new conceptual and methodological challenge of text-image collapse introduced by multimodal generative AI. Our focus is on three questions:

What can platform affordances teach us about multimodal generative AI?
What can user prompting practices reveal about how people engage with this technology?
How can AI-generated images be analyzed as part of visual generative communication?

Two kinds of examples are used: aesthetic remediation, exploring how artistic styles are absorbed and repurposed by generative AI models, and political communication, examining how war and conflict are represented and mediated through multimodal generative AI.

Workshop:Text-image collapse: resolving some of the challenges of multimodal generative AI
In this hands-on workshop, we will examine how text-image collapse in multimodal generative AI applies to actual data, and what kinds of insights can be found through the study of generative platform affordances, prompts, and AI-generated images themselves. We will build system networks to map the low-level affordances of selected image generators and will distinguish between different kinds of prompting strategies. We will also discuss the relationships between prompt specificity and generative renderings, considering how different levels of textual detail shape visual outputs.

Suggested pre-reading:
(1) Bajohr, H. (2024). Operative ekphrasis: The collapse of the text/image distinction in multimodal AI. Word & Image, 40(2), 77–90. https://doi.org/10.1080/02666286.2024.2330335
(2) Laba, N., & Bouko, C. (2026). Introduction: Making sense of AI-generated images. In C. Bouko & N. Laba (Eds.), Six critical lenses on AI-generated images (pp. 1–24). CRC Press. https://doi.org/10.1201/9781003740261-1
(3) Weatherby, L., & Justie, B. (2022). Indexical AI. Critical Inquiry, 48(2), 381–415. https://doi.org/10.1086/717312

Thursday June 11^th – Afternoon Session

Wu Ping (Beijing Language and Culture University)
Lecture: Large Language Models for Emotion Analysis in Literary Translation: A Case Study of Yu Hua’s “To Live”
This study investigates how large language models (LLMs) can support theory-guided emotion analysis in literary translation. Drawing on Appraisal Theory, we develop an annotation workflow that operationalises evaluative meaning across the Affect, Engagement, and Graduation subsystems and apply it to Yu Hua’s novel To Live. The analysis compares the Chinese source text, Michael Berry’s English translation, and the translations generated by contemporary LLMs. We adopt a theory-guided prompting strategy to produce structured evaluative annotations and calibrate the procedure against a
human-annotated gold-standard subset to assess annotation reliability and the effects of prompt design. The results show that theory-informed prompting improves the stability of appraisal-based annotation and enables systematic comparison of evaluative patterns across translation conditions.

Suggested pre-reading:
(1) Rebora, S. (2023). Sentiment analysis in literary studies. a critical survey. Digital Humanities Quarterly, 17(2): 1-17.
(2) Martin, J. R. &White, P. R. (2005). The Language of Evaluation: Appraisal in English. Palgrave Macmillan, London ＆New York: Palgrave Macmillan.

Panel:

Sun Hongbo, Corpus-Driven Evidence of Syntactic Simplification and Lexical Explicitation in Translational English
Abstract: This study investigates whether translational English exhibits systematic syntactic simplification and lexical explicitation compared to native English, using two large-scale parallel and comparable corpora. Following a corpus-driven, multi-dimensional analytic framework, it employs quantitative metrics—including T-unit complexity, type-token ratio, and collocational density—alongside qualitative annotation to identify cross-linguistic patterns. Results reveal a significant reduction in T-unit length (p < .01), overreliance on high-frequency collocations, and increased semantic explicitness via redundant modifiers. These features are closely linked to source language interference and translational norms. The findings contribute empirical evidence to the ongoing debate on translation universals, while challenging the assumed universality of simplification by revealing its collocational conditioning. This study also offers pedagogical implications for EFL collocation instruction and practical guidance for corpus-based translation quality assessment. It demonstrates the value of data-driven methods in uncovering latent features of translated language and reinforces the role of corpus linguistics in translation studies.

Li Qian, From Passive Retrieval to Critical Curation: Redefining the Literature Review in the Age of AI
Abstract: The rapid integration of Generative AI (GenAI) into academic workflows has transformed the literature review from a labor-intensive process of manual discovery into a high-speed exercise in automated synthesis. However, this shift presents a “double-edged sword” for Digital Humanities: while AI offers unprecedented efficiency in mapping vast scholarly corpora, it risks fostering intellectual laziness by replacing deep cognitive engagement with passive retrieval. As scholars, we face a critical juncture: is AI expanding our research horizons, or is it hollowing out the analytical foundations of our work?
The presenter observes a unique cultural and linguistic dimension to this debate. For non-native English-speaking (L2) researchers, AI serves as a powerful bridge, lowering the linguistic barriers to international publication and academic discourse. Yet, this reliance introduces the risk of “Linguistic Homogenization.” Because most Large Language Models are trained on Western-centric datasets, they tend to standardize the unique rhetorical voices and cultural nuances of Chinese scholarship into global academic tropes. When the AI “refines” the language, it may inadvertently erase the original perspective of the researcher, leading to a loss of epistemic diversity in the global humanities.
To manage these risks, this presentation proposes a transition from “Passive Retrieval” toward a “Critical Curation” framework. This model moves away from treating AI as an interpretive replacement and instead positions it as a structural assistant within a “human-in-the-loop” system. I introduce practical management strategies, including “Triangulated Verification”—cross-referencing AI outputs against multiple models and primary sources—and the “Audit Trail” method for transparent AI disclosure. By redefining the scholar’s role as a curator of synthesized knowledge rather than a consumer of automated summaries, we ensure that digital communication remains a rigorous, human-centered endeavor.

Cao Yanli, The Limits of Mechanical Reasoning in the Generation of Embodied Metaphors by Large Language Models
Abstract: Embodied cognition theory posits that abstract concepts are rooted in sensorimotor experience. This study investigates whether disembodied Large Language Models (LLMs) can authentically simulate such embodied metaphors. Utilizing GPT-5.4, we conducted generative probe tasks covering seven core body parts: heart, head, eye, mouth, hand, feet, and body. These body parts cover multiple phenomenological dimensions, including emotional experience, proprioception, and sensorimotor processes, which comprehensively reflect the scope of embodied experience. For the analytical approach, this study conducts a qualitative analysis of the generated results from the dimensions of embodiment intensity, richness of perceptual details, dynamicity of experience, and compatibility with cultural contexts, etc. Results indicate that the model demonstrates a sophisticated command of the functional mappings between physical attributes and abstract domains. However, qualitative analysis suggests a potential disconnect between structural logic and visceral intuition. Specifically, while metaphors involving muscle tension appear natural, those describing cognitive states often exhibit traits of “mechanical reasoning.” These findings imply that GPT-5.4 has likely acquired the syntactic logic of embodied language but may not yet fully possess the experiential semantics of sensation. The model functions more as a rational observer of physical rules than as an authentic embodied experiencer.

Friday June 12^th – Morning Session

Niall Curry (Manchester Metropolitan University)
Lecture: A critical reflection on GenAI use in applied linguistics research
The role of Generative AI (GenAI) in the research process has emerged as a key topic of critical debate in corpus linguistics. For every proposed boon that the use of GenAI heralds, there is a complementary bane, and while both the body of research on Gen AI use and research in corpus linguistics using GenAI continues to grow, we do not appear to have arrived at any clear consensus surrounding its affordances and limitations. In this talk, I draw on some recent work that addresses the issue of GenAI use in corpus linguistics research. The talk spotlights some of the key ideas emerging from this work, addressing questions of GenAI literacy, ethics, knowledge-making, and the relevance of large language models for corpus linguistics research. Through this exploration of emergent key issues, I reflect on the ‘goodness of the fit’ of GenAI for our research activity and consider the research areas in which the application of GenAI may be a) ineffective, b) antithetical to our research agenda, or c) pose some opportunity for research and knowledge-making.

Workshop: Applying a human perspective to GenAI use in the research process
Building on the Lecture, I will provide a case study demonstrating the application of the established framework to language teaching and learning. I will then ask participants to reflect on the application of this framework to research of their own design. This will afford participants an opportunity to localise this critical reflection within their own research paradigms and determine the affordances and limitations of GenAI therein. Participants will have an opportunity to present their reflections and get feedback on their proposed research and engagement with GenAI.

Suggested pre-reading:
(1) Curry, N., McEnery, T., & Brookes, G. (2025). A question of alignment – AI, GenAI and applied linguistics. Annual Review of Applied Linguistics, 45, 315-336. https://doi.org/10.1017/S0267190525000017
(2) Pérez-Paredes, P., Curry N., & Aguado Jiménez, P. (2025). Integrating critical corpus and AI literacies in applied linguistics: A mixed-methods study. Computer Assisted Language Learning, 1-27. https://doi.org/10.1080/09588221.2025.2569351
(3) Pérez-Paredes, P., Curry N., & Ordoñana-Guillamón, C. (2025). Critical AI literacy for applied linguistics and language education students. Journal of China Computer-Assisted Language Learning, 5(1), 1-40. https://doi.org/10.1515/jccall-2025-0005
(4) Curry, N., Baker, P., & Brookes, G. (2024). Generative AI for corpus approaches to discourse studies: A critical evaluation of ChatGPT. Applied Corpus Linguistics, 4(1), 1-9. https://doi.org/10.1016/j.acorp.2023.100082