The 8th Workshop on Clinical Natural Language Processing

At LREC 2026. Palma, Mallorca (Spain). Sat 16 May 2026.

Workshop Program

Sat 16 May 2026 (all times CEST)

09:00–09:45 Opening
09:00–09:30 Keynote: Holistic engagement beyond the edges of data-driven science: The case of Libya
Speaker: Dr. Stephen Wu
Abstract: We often make some assumptions about the ethics (e.g., individual rights) and structures (e.g., research enterprise) supporting health technologies. This talk argues that, for societies like Libya's, some paradigm shifts are in order. First, we contrast ethical values between honor vs. dignity cultures. Second, we highlight aspects of the Libyan research system that fundamentally change what work needs to be done in health technology research. Subsequently, we describe several recent holistic, contextually situated efforts at engagement: private research sponsorship and administration, science of science research, and care practice norms and information flow in medicine.
Biography: Dr. Stephen Wu is an Associate Professor at Saraya Hamra University (SHU) in Tripoli, Libya, where he serves as the Director for the Center for Research & Innovation as well as the Head of the Department of Computer Science & Engineering. His current interests lie in empowering the Libyan innovation system through the science of science, the interaction of global value systems with modern AI, and culturally situated healthcare information systems.
09:30–09:45 Keynote Q&A
09:45–10:00 MEDIQA-EVAL 2026 Shared Task
09:45–09:55 Overview of the MEDIQA-EVAL 2026 Shared Task on Evaluation Metrics in Medical Multimodal Question Answering
Asma Ben Abacha and Wen-wai Yim
09:55–10:00 SUAT-BMI at MEDIQA-EVAL 2026: An Ensemble Approach to Language Models as Judges for Automatic Rating of Medical Responses
Xinzhe Peng, Liyuan E, Kun Feng, Jielin Li, Yuxuan Tang and Zhao Li
10:00–10:15 MEDIQA-SYNUR 2026 Shared Task
10:00–10:10 Overview of the MEDIQA-SYNUR 2026 Shared Task on Observation Extraction from Nurse Dictations
George Michalopoulos, Jean-Philippe Corbeil, Cari Bader, Nathan Bodenstab and Asma Ben Abacha
10:10–10:15 SemAnTICA Lab at MediQA-SYNUR 2026: Route, Extract and Verify -- An LLM-gated Ensemble for Parsing Nurse Dictations
Sy Hwang, Katherine S. Pitcher, Sue Hyon Kim, Yoonjae Lee, Hayoung K. Donelly, Harsh Bandhey, Andrew J. King, Karen O'Connor, Ryan J. Urbanowicz and Danielle L. Mowery
10:15–11:15 Poster Session
L2D-Clinical: Learning to Defer for Adaptive Model Selection in Clinical Text Classification
Rishik Kondadadi and John E. Ortega
TRUMEDIQA: A Modular Trustworthy RAG Pipeline for Multilingual Medical Question Answering
Jihad Zahir, Ayoub Nainia and Meryem El Fatimi
Evaluating the Retrieval Component in a Retrieval-Augmented Summarization System for Patient Records in French
Marco Naguib, Christel Gérardin, Victor Beaucoté, Cyril Charron, Adrien Joseph, Aurélie Névéol and Xavier Tannier
Retrieval-Augmented Generation Based Nurse Observation Extraction
Kyomin Hwang and Nojun Kwak
Automatic Generation of Discharge Summaries Using Large Language Models: A Systematic Literature Review
Lucas Molino-Piñar, Manuel Carlos Diaz Galiano and María-Teresa Martín-Valdivia
Smart_solutions at MEDIQA-SYNUR 2026: A Multi-Stage LLM Pipeline for Nursing Observation Extraction
Prateek Munjal
GS-BrainText: A Multi-Site Brain Imaging Report Dataset from Generation Scotland for Clinical Natural Language Processing Development and Validation
Beatrice Alex, Claire Grover, Arlene Casey, Richard Tobin, Heather Whalley and William Whiteley
SASTA Self Assessment: An efficient human-in-the-loop strategy for developmental and pathological language analysis
Jan Odijk, Jelte van Boheemen, Xander Vertegaal, Tessel Boerma and Marijn Schraagen
Differentially Private De-identification of Dutch Clinical Notes: A Comparative Evaluation
Michele Miranda, Xinlan Yan, Nishant Mishra, Rachel Murphy, Ameen Abu Hanna, Sébastien Bratières and Iacer Calixto
SQUCS at MEDIQA-SYNUR 2026: A Multi-Agent Open Source LLM System for Nursing Observation Extraction
Riham JeebAllah, Adhari AlZaabi and Abdulrahman Khalifa AAlAbdulsalam
LTRC-IIIT at MEDIQA-SYNUR 2026: Benchmarking a Fully Local, Training-Free RAG Pipeline
Aashwin Vaish and Dipti Misra Sharma
BDI at MEDIQA-EVAL 2026: A ReAct-Style Multimodal Agent for Fine-Grained Medical Response Assessment
Justin Xu, Zizheng Zhang, Augustine Luk, Benjamin Khong, Haochen Cui, Samuel Hwang, Alyssa Pradhan, Kevin Yuan and David W. Eyre
MasonNLP at MEDIQA-SYNUR 2026: Retrieval-Augmented Large Language Models for Schema-Constrained Clinical Information Extraction
A H M Rezaul Karim and Özlem Uzuner
Night Shift Nerds at MEDIQA-SYNUR 2026: Pushing Small Large Language Model Capability for Clinical Observation Extraction and Normalization from Nurse Dictation using RLVR
Bayu Aryoyudanta, Maria Yuliana, Mikie Rachman and I Made Agus Setiawan
Extracting Medication Instructions from Dutch General Practice Electronic Health Records with Local Natural Language Processing
Marya Dukmak, Constanza L. Andaur Navarro and Artuur Leeuwenberg
Gladiator at MEDIQA-SYNUR 2026: Contextual Clinical Extraction: Integrating Foundation Models with Domain-Specific Validation Rules
Siva Satyanarayana Raju Pusapati and Ankit Singh
MedAware at MEDIQA-EVAL 2026: Vision-Language Model Fine-Tuning with Logprob-Based Score Calibration for Medical Response Evaluation
Ziqi Hao and Pengbo Liu
HSE NLP TEAM at MEDIQA-SYNUR 2026: Consensus Adjudication Ensemble (ACE): Balancing Precision and Recall for Schema-Bystander Clinical Extraction
Airat A. Valiev
Lakefront AI Ramblers at MEDIQA-SYNUR 2026: Hybrid Retrieval and LLM Verification for Open-Source Schema-Guided Clinical Information Extraction
Michael T. Saban, Arsalan Yaghoubi, Behnaz Eslami, Samie Tootooni and Dmitriy Dligach
JMedWiC: A Japanese Word-in-Context Dataset in the Medical Domain
Koki Horiguchi, Seiji Sugiyama, Tomoyuki Kajiwara, Shoko Wakamiya and Eiji ARAMAKI
LTRC-Medicom at MEDIQA-SYNUR 2026: Schema-Guided Clinical Information Extraction with Hybrid Clustering-SFT-Verification
Pasumarthy Deepak, Sushvin Marimuthu and Parameswari Krishnamurthy
SloCal-Net at MEDIQA-Eval 2026: Investigating the Impact of Reasoning and External Context on Medical Answer Grading
Primoz Kocbek, Valentina Carbonari, Pierangelo Veltri, Pietro Hiram Guzzi and Gregor Stiglic
MIDAS_SYNUR at MEDIQA-SYNUR 2026: A Prompting Study for Clinical Observation Extraction from Nurse Dictation Transcriptions
Swetha Krishna Sriram and Akshitaa Sahoo
AnotherOne at MEDIQA-SYNUR 2026: Detect, Extract, Normalize - Knowledge-Grounded LLM Pipeline for Clinical Observation Extraction
Jerrin John Thomas and Parameswari Krishnamurthy
hgkai26 at MEDIQA-EVAL 2026: Automated Evaluation of Visual Medical Question Answering Using LLM-as-a-Judge
Haritha Gangavarapu
Role-Adapted Clinical Report Generation for Ultrasound Measurements in Low-Resource Settings
Ayoub Nainia, Tanya Akumu, Noussair Lazrak and Karim Lekadir
A Comparative Study of Approaches to Anonymization of Clinical Free Text in Spanish
Florencia Luciana Brunello, Laura Alonso Alemany, Serena Villata and Milagro Teruel
Disagreement-Driven Joint Refinement of Retrieval and Decision Rules for Imbalanced Counseling Risk Classification
Zhihao Shao, Ryo Sekizaki, Shengzhou Yi and Toshihiko Yamasaki
Context-Aware SNOMED CT Entity Linking for Clinical Text
Provia Kadusabe, Demian Gholipour Ghalandari, Lauren Cassidy, Jack Boylan, Chris Hokamp, Abhishek Kaushik and Fiona Lawless
Temporal Structure in Clinical Narratives in Portuguese: Insights from Cross-Document Annotation
Ana Luisa Fernandes, Purificação Silvano and Luís Filipe Cunha
11:15–11:45 Special Track on Low Resource Settings
11:15–11:30 MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification in Low-Resource Settings
Alice Schiavone, Marco Fraccaro, Lea Marie Pehrson, Silvia Ingala, Rasmus Bonnevie, Michael Bachmann Nielsen, Vincent Beliveau, Melanie Ganz and Desmond Elliott
11:30–11:45 MedNormJ: A Benchmark Dataset for Medical Concept Normalization in Japanese Clinical Documents
Yuki Tashiro, Seiji Shimizu, Tomohiro Nishiyama, Shoko Wakamiya and Eiji ARAMAKI
11:45–12:15 Special Track on Construction or Deconstruction of LLMs
11:45–12:00 Pediatric Sepsis Cohort Detection Using In-Context Pointwise V-Usable Information
Yingya Li, Alon Geva, Steven Bethard, Timothy A. Miller, Kate Madden, Matthew A. Eisenberg, Daniel P. Kelly and Guergana Savova
12:00–12:15 RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems
Adarsh Srinivasan, Jacob Dineen, Muhammad Uzair Sarfraz, Muhammad Umar Afzal, Irbaz Riaz and Ben Zhou
12:15–13:00 Main Track
12:15–12:30 Disentangling Ambiguity from Instability in Large Language Models: A Clinical Text-to-SQL Case Study
Angelo Ziletti and Leonardo D'Ambrosi
12:30–12:45 An OMOP-Based Open-Source Text-to-SQL Benchmark Dataset
Paul Legrand, Kawsar Noor, Satyam Bhagwanani and Richard J. Dobson
12:45–13:00 Profiling Hallucinations in Frontier LLMs for Entity Linking to Medical Ontologies
Logan Born, Nishant Kambhatla, Uliyana Kubasova, Maryam Siahbani, Andrei Vacariu, Timothy W. O'Connell and Anoop Sarkar
13:00–13:00 Closing