Clinical NLP Workshop 2026

Workshop Program

Sat 16 May 2026 (all times CEST)
09:00–09:45	Opening
09:00–09:30	Keynote: Holistic engagement beyond the edges of data-driven science: The case of Libya Speaker: Dr. Stephen Wu Abstract: We often make some assumptions about the ethics (e.g., individual rights) and structures (e.g., research enterprise) supporting health technologies. This talk argues that, for societies like Libya's, some paradigm shifts are in order. First, we contrast ethical values between honor vs. dignity cultures. Second, we highlight aspects of the Libyan research system that fundamentally change what work needs to be done in health technology research. Subsequently, we describe several recent holistic, contextually situated efforts at engagement: private research sponsorship and administration, science of science research, and care practice norms and information flow in medicine. Biography: Dr. Stephen Wu is an Associate Professor at Saraya Hamra University (SHU) in Tripoli, Libya, where he serves as the Director for the Center for Research & Innovation as well as the Head of the Department of Computer Science & Engineering. His current interests lie in empowering the Libyan innovation system through the science of science, the interaction of global value systems with modern AI, and culturally situated healthcare information systems.
09:30–09:45	Keynote Q&A
09:45–10:00	MEDIQA-EVAL 2026 Shared Task
09:45–09:55	Overview of the MEDIQA-EVAL 2026 Shared Task on Evaluation Metrics in Medical Multimodal Question Answering Asma Ben Abacha and Wen-wai Yim
09:55–10:00	SUAT-BMI at MEDIQA-EVAL 2026: An Ensemble Approach to Language Models as Judges for Automatic Rating of Medical Responses Xinzhe Peng, Liyuan E, Kun Feng, Jielin Li, Yuxuan Tang and Zhao Li
10:00–10:15	MEDIQA-SYNUR 2026 Shared Task
10:00–10:10	Overview of the MEDIQA-SYNUR 2026 Shared Task on Observation Extraction from Nurse Dictations George Michalopoulos, Jean-Philippe Corbeil, Cari Bader, Nathan Bodenstab and Asma Ben Abacha
10:10–10:15	SemAnTICA Lab at MediQA-SYNUR 2026: Route, Extract and Verify -- An LLM-gated Ensemble for Parsing Nurse Dictations Sy Hwang, Katherine S. Pitcher, Sue Hyon Kim, Yoonjae Lee, Hayoung K. Donelly, Harsh Bandhey, Andrew J. King, Karen O'Connor, Ryan J. Urbanowicz and Danielle L. Mowery
10:15–11:15	Poster Session
	L2D-Clinical: Learning to Defer for Adaptive Model Selection in Clinical Text Classification Rishik Kondadadi and John E. Ortega
	TRUMEDIQA: A Modular Trustworthy RAG Pipeline for Multilingual Medical Question Answering Jihad Zahir, Ayoub Nainia and Meryem El Fatimi
	Evaluating the Retrieval Component in a Retrieval-Augmented Summarization System for Patient Records in French Marco Naguib, Christel Gérardin, Victor Beaucoté, Cyril Charron, Adrien Joseph, Aurélie Névéol and Xavier Tannier
	Retrieval-Augmented Generation Based Nurse Observation Extraction Kyomin Hwang and Nojun Kwak
	Automatic Generation of Discharge Summaries Using Large Language Models: A Systematic Literature Review Lucas Molino-Piñar, Manuel Carlos Diaz Galiano and María-Teresa Martín-Valdivia
	Smart_solutions at MEDIQA-SYNUR 2026: A Multi-Stage LLM Pipeline for Nursing Observation Extraction Prateek Munjal
	GS-BrainText: A Multi-Site Brain Imaging Report Dataset from Generation Scotland for Clinical Natural Language Processing Development and Validation Beatrice Alex, Claire Grover, Arlene Casey, Richard Tobin, Heather Whalley and William Whiteley
	SASTA Self Assessment: An efficient human-in-the-loop strategy for developmental and pathological language analysis Jan Odijk, Jelte van Boheemen, Xander Vertegaal, Tessel Boerma and Marijn Schraagen
	Differentially Private De-identification of Dutch Clinical Notes: A Comparative Evaluation Michele Miranda, Xinlan Yan, Nishant Mishra, Rachel Murphy, Ameen Abu Hanna, Sébastien Bratières and Iacer Calixto
	SQUCS at MEDIQA-SYNUR 2026: A Multi-Agent Open Source LLM System for Nursing Observation Extraction Riham JeebAllah, Adhari AlZaabi and Abdulrahman Khalifa AAlAbdulsalam
	LTRC-IIIT at MEDIQA-SYNUR 2026: Benchmarking a Fully Local, Training-Free RAG Pipeline Aashwin Vaish and Dipti Misra Sharma
	BDI at MEDIQA-EVAL 2026: A ReAct-Style Multimodal Agent for Fine-Grained Medical Response Assessment Justin Xu, Zizheng Zhang, Augustine Luk, Benjamin Khong, Haochen Cui, Samuel Hwang, Alyssa Pradhan, Kevin Yuan and David W. Eyre
	MasonNLP at MEDIQA-SYNUR 2026: Retrieval-Augmented Large Language Models for Schema-Constrained Clinical Information Extraction A H M Rezaul Karim and Özlem Uzuner
	Night Shift Nerds at MEDIQA-SYNUR 2026: Pushing Small Large Language Model Capability for Clinical Observation Extraction and Normalization from Nurse Dictation using RLVR Bayu Aryoyudanta, Maria Yuliana, Mikie Rachman and I Made Agus Setiawan
	Extracting Medication Instructions from Dutch General Practice Electronic Health Records with Local Natural Language Processing Marya Dukmak, Constanza L. Andaur Navarro and Artuur Leeuwenberg
	Gladiator at MEDIQA-SYNUR 2026: Contextual Clinical Extraction: Integrating Foundation Models with Domain-Specific Validation Rules Siva Satyanarayana Raju Pusapati and Ankit Singh
	MedAware at MEDIQA-EVAL 2026: Vision-Language Model Fine-Tuning with Logprob-Based Score Calibration for Medical Response Evaluation Ziqi Hao and Pengbo Liu
	HSE NLP TEAM at MEDIQA-SYNUR 2026: Consensus Adjudication Ensemble (ACE): Balancing Precision and Recall for Schema-Bystander Clinical Extraction Airat A. Valiev
	Lakefront AI Ramblers at MEDIQA-SYNUR 2026: Hybrid Retrieval and LLM Verification for Open-Source Schema-Guided Clinical Information Extraction Michael T. Saban, Arsalan Yaghoubi, Behnaz Eslami, Samie Tootooni and Dmitriy Dligach
	JMedWiC: A Japanese Word-in-Context Dataset in the Medical Domain Koki Horiguchi, Seiji Sugiyama, Tomoyuki Kajiwara, Shoko Wakamiya and Eiji ARAMAKI
	LTRC-Medicom at MEDIQA-SYNUR 2026: Schema-Guided Clinical Information Extraction with Hybrid Clustering-SFT-Verification Pasumarthy Deepak, Sushvin Marimuthu and Parameswari Krishnamurthy
	SloCal-Net at MEDIQA-Eval 2026: Investigating the Impact of Reasoning and External Context on Medical Answer Grading Primoz Kocbek, Valentina Carbonari, Pierangelo Veltri, Pietro Hiram Guzzi and Gregor Stiglic
	MIDAS_SYNUR at MEDIQA-SYNUR 2026: A Prompting Study for Clinical Observation Extraction from Nurse Dictation Transcriptions Swetha Krishna Sriram and Akshitaa Sahoo
	AnotherOne at MEDIQA-SYNUR 2026: Detect, Extract, Normalize - Knowledge-Grounded LLM Pipeline for Clinical Observation Extraction Jerrin John Thomas and Parameswari Krishnamurthy
	hgkai26 at MEDIQA-EVAL 2026: Automated Evaluation of Visual Medical Question Answering Using LLM-as-a-Judge Haritha Gangavarapu
	Role-Adapted Clinical Report Generation for Ultrasound Measurements in Low-Resource Settings Ayoub Nainia, Tanya Akumu, Noussair Lazrak and Karim Lekadir
	A Comparative Study of Approaches to Anonymization of Clinical Free Text in Spanish Florencia Luciana Brunello, Laura Alonso Alemany, Serena Villata and Milagro Teruel
	Disagreement-Driven Joint Refinement of Retrieval and Decision Rules for Imbalanced Counseling Risk Classification Zhihao Shao, Ryo Sekizaki, Shengzhou Yi and Toshihiko Yamasaki
	Context-Aware SNOMED CT Entity Linking for Clinical Text Provia Kadusabe, Demian Gholipour Ghalandari, Lauren Cassidy, Jack Boylan, Chris Hokamp, Abhishek Kaushik and Fiona Lawless
	Temporal Structure in Clinical Narratives in Portuguese: Insights from Cross-Document Annotation Ana Luisa Fernandes, Purificação Silvano and Luís Filipe Cunha
11:15–11:45	Special Track on Low Resource Settings
11:15–11:30	MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification in Low-Resource Settings Alice Schiavone, Marco Fraccaro, Lea Marie Pehrson, Silvia Ingala, Rasmus Bonnevie, Michael Bachmann Nielsen, Vincent Beliveau, Melanie Ganz and Desmond Elliott
11:30–11:45	MedNormJ: A Benchmark Dataset for Medical Concept Normalization in Japanese Clinical Documents Yuki Tashiro, Seiji Shimizu, Tomohiro Nishiyama, Shoko Wakamiya and Eiji ARAMAKI
11:45–12:15	Special Track on Construction or Deconstruction of LLMs
11:45–12:00	Pediatric Sepsis Cohort Detection Using In-Context Pointwise V-Usable Information Yingya Li, Alon Geva, Steven Bethard, Timothy A. Miller, Kate Madden, Matthew A. Eisenberg, Daniel P. Kelly and Guergana Savova
12:00–12:15	RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems Adarsh Srinivasan, Jacob Dineen, Muhammad Uzair Sarfraz, Muhammad Umar Afzal, Irbaz Riaz and Ben Zhou
12:15–13:00	Main Track
12:15–12:30	Disentangling Ambiguity from Instability in Large Language Models: A Clinical Text-to-SQL Case Study Angelo Ziletti and Leonardo D'Ambrosi
12:30–12:45	An OMOP-Based Open-Source Text-to-SQL Benchmark Dataset Paul Legrand, Kawsar Noor, Satyam Bhagwanani and Richard J. Dobson
12:45–13:00	Profiling Hallucinations in Frontier LLMs for Entity Linking to Medical Ontologies Logan Born, Nishant Kambhatla, Uliyana Kubasova, Maryam Siahbani, Andrei Vacariu, Timothy W. O'Connell and Anoop Sarkar
13:00–13:00	Closing

The 8th Workshop on Clinical Natural Language Processing

At LREC 2026. Palma, Mallorca (Spain). Sat 16 May 2026.

Workshop Program

Sat 16 May 2026 (all times CEST)