Sat 16 May 2026 (all times CEST) |
|
| 09:00–09:45 | Opening |
| 09:00–09:30 | Keynote: Holistic engagement beyond the edges of data-driven science: The case of Libya
Speaker: Dr. Stephen Wu Abstract: We often make some assumptions about the ethics (e.g., individual rights) and structures (e.g., research enterprise) supporting health technologies. This talk argues that, for societies like Libya's, some paradigm shifts are in order. First, we contrast ethical values between honor vs. dignity cultures. Second, we highlight aspects of the Libyan research system that fundamentally change what work needs to be done in health technology research. Subsequently, we describe several recent holistic, contextually situated efforts at engagement: private research sponsorship and administration, science of science research, and care practice norms and information flow in medicine. Biography: Dr. Stephen Wu is an Associate Professor at Saraya Hamra University (SHU) in Tripoli, Libya, where he serves as the Director for the Center for Research & Innovation as well as the Head of the Department of Computer Science & Engineering. His current interests lie in empowering the Libyan innovation system through the science of science, the interaction of global value systems with modern AI, and culturally situated healthcare information systems. |
| 09:30–09:45 | Keynote Q&A |
| 09:45–10:00 | MEDIQA-EVAL 2026 Shared Task |
| 09:45–09:55 | Overview of the MEDIQA-EVAL 2026 Shared Task on Evaluation Metrics in Medical Multimodal Question Answering Asma Ben Abacha and Wen-wai Yim |
| 09:55–10:00 | SUAT-BMI at MEDIQA-EVAL 2026: An Ensemble Approach to Language Models as Judges for Automatic Rating of Medical Responses Xinzhe Peng, Liyuan E, Kun Feng, Jielin Li, Yuxuan Tang and Zhao Li |
| 10:00–10:15 | MEDIQA-SYNUR 2026 Shared Task |
| 10:00–10:10 | Overview of the MEDIQA-SYNUR 2026 Shared Task on Observation Extraction from Nurse Dictations George Michalopoulos, Jean-Philippe Corbeil, Cari Bader, Nathan Bodenstab and Asma Ben Abacha |
| 10:10–10:15 | SemAnTICA Lab at MediQA-SYNUR 2026: Route, Extract and Verify -- An LLM-gated Ensemble for Parsing Nurse Dictations Sy Hwang, Katherine S. Pitcher, Sue Hyon Kim, Yoonjae Lee, Hayoung K. Donelly, Harsh Bandhey, Andrew J. King, Karen O'Connor, Ryan J. Urbanowicz and Danielle L. Mowery |
| 10:15–11:15 | Poster Session |
| L2D-Clinical: Learning to Defer for Adaptive Model Selection in Clinical Text Classification Rishik Kondadadi and John E. Ortega |
|
| TRUMEDIQA: A Modular Trustworthy RAG Pipeline for Multilingual Medical Question Answering Jihad Zahir, Ayoub Nainia and Meryem El Fatimi |
|
| Evaluating the Retrieval Component in a Retrieval-Augmented Summarization System for Patient Records in French Marco Naguib, Christel Gérardin, Victor Beaucoté, Cyril Charron, Adrien Joseph, Aurélie Névéol and Xavier Tannier |
|
| Retrieval-Augmented Generation Based Nurse Observation Extraction Kyomin Hwang and Nojun Kwak |
|
| Automatic Generation of Discharge Summaries Using Large Language Models: A Systematic Literature Review Lucas Molino-Piñar, Manuel Carlos Diaz Galiano and María-Teresa Martín-Valdivia |
|
| Smart_solutions at MEDIQA-SYNUR 2026: A Multi-Stage LLM Pipeline for Nursing Observation Extraction Prateek Munjal |
|
| GS-BrainText: A Multi-Site Brain Imaging Report Dataset from Generation Scotland for Clinical Natural Language Processing Development and Validation Beatrice Alex, Claire Grover, Arlene Casey, Richard Tobin, Heather Whalley and William Whiteley |
|
| SASTA Self Assessment: An efficient human-in-the-loop strategy for developmental and pathological language analysis Jan Odijk, Jelte van Boheemen, Xander Vertegaal, Tessel Boerma and Marijn Schraagen |
|
| Differentially Private De-identification of Dutch Clinical Notes: A Comparative Evaluation Michele Miranda, Xinlan Yan, Nishant Mishra, Rachel Murphy, Ameen Abu Hanna, Sébastien Bratières and Iacer Calixto |
|
| SQUCS at MEDIQA-SYNUR 2026: A Multi-Agent Open Source LLM System for Nursing Observation Extraction Riham JeebAllah, Adhari AlZaabi and Abdulrahman Khalifa AAlAbdulsalam |
|
| LTRC-IIIT at MEDIQA-SYNUR 2026: Benchmarking a Fully Local, Training-Free RAG Pipeline Aashwin Vaish and Dipti Misra Sharma |
|
| BDI at MEDIQA-EVAL 2026: A ReAct-Style Multimodal Agent for Fine-Grained Medical Response Assessment Justin Xu, Zizheng Zhang, Augustine Luk, Benjamin Khong, Haochen Cui, Samuel Hwang, Alyssa Pradhan, Kevin Yuan and David W. Eyre |
|
| MasonNLP at MEDIQA-SYNUR 2026: Retrieval-Augmented Large Language Models for Schema-Constrained Clinical Information Extraction A H M Rezaul Karim and Özlem Uzuner |
|
| Night Shift Nerds at MEDIQA-SYNUR 2026: Pushing Small Large Language Model Capability for Clinical Observation Extraction and Normalization from Nurse Dictation using RLVR Bayu Aryoyudanta, Maria Yuliana, Mikie Rachman and I Made Agus Setiawan |
|
| Extracting Medication Instructions from Dutch General Practice Electronic Health Records with Local Natural Language Processing Marya Dukmak, Constanza L. Andaur Navarro and Artuur Leeuwenberg |
|
| Gladiator at MEDIQA-SYNUR 2026: Contextual Clinical Extraction: Integrating Foundation Models with Domain-Specific Validation Rules Siva Satyanarayana Raju Pusapati and Ankit Singh |
|
| MedAware at MEDIQA-EVAL 2026: Vision-Language Model Fine-Tuning with Logprob-Based Score Calibration for Medical Response Evaluation Ziqi Hao and Pengbo Liu |
|
| HSE NLP TEAM at MEDIQA-SYNUR 2026: Consensus Adjudication Ensemble (ACE): Balancing Precision and Recall for Schema-Bystander Clinical Extraction Airat A. Valiev |
|
| Lakefront AI Ramblers at MEDIQA-SYNUR 2026: Hybrid Retrieval and LLM Verification for Open-Source Schema-Guided Clinical Information Extraction Michael T. Saban, Arsalan Yaghoubi, Behnaz Eslami, Samie Tootooni and Dmitriy Dligach |
|
| JMedWiC: A Japanese Word-in-Context Dataset in the Medical Domain Koki Horiguchi, Seiji Sugiyama, Tomoyuki Kajiwara, Shoko Wakamiya and Eiji ARAMAKI |
|
| LTRC-Medicom at MEDIQA-SYNUR 2026: Schema-Guided Clinical Information Extraction with Hybrid Clustering-SFT-Verification Pasumarthy Deepak, Sushvin Marimuthu and Parameswari Krishnamurthy |
|
| SloCal-Net at MEDIQA-Eval 2026: Investigating the Impact of Reasoning and External Context on Medical Answer Grading Primoz Kocbek, Valentina Carbonari, Pierangelo Veltri, Pietro Hiram Guzzi and Gregor Stiglic |
|
| MIDAS_SYNUR at MEDIQA-SYNUR 2026: A Prompting Study for Clinical Observation Extraction from Nurse Dictation Transcriptions Swetha Krishna Sriram and Akshitaa Sahoo |
|
| AnotherOne at MEDIQA-SYNUR 2026: Detect, Extract, Normalize - Knowledge-Grounded LLM Pipeline for Clinical Observation Extraction Jerrin John Thomas and Parameswari Krishnamurthy |
|
| hgkai26 at MEDIQA-EVAL 2026: Automated Evaluation of Visual Medical Question Answering Using LLM-as-a-Judge Haritha Gangavarapu |
|
| Role-Adapted Clinical Report Generation for Ultrasound Measurements in Low-Resource Settings Ayoub Nainia, Tanya Akumu, Noussair Lazrak and Karim Lekadir |
|
| A Comparative Study of Approaches to Anonymization of Clinical Free Text in Spanish Florencia Luciana Brunello, Laura Alonso Alemany, Serena Villata and Milagro Teruel |
|
| Disagreement-Driven Joint Refinement of Retrieval and Decision Rules for Imbalanced Counseling Risk Classification Zhihao Shao, Ryo Sekizaki, Shengzhou Yi and Toshihiko Yamasaki |
|
| Context-Aware SNOMED CT Entity Linking for Clinical Text Provia Kadusabe, Demian Gholipour Ghalandari, Lauren Cassidy, Jack Boylan, Chris Hokamp, Abhishek Kaushik and Fiona Lawless |
|
| Temporal Structure in Clinical Narratives in Portuguese: Insights from Cross-Document Annotation Ana Luisa Fernandes, Purificação Silvano and Luís Filipe Cunha |
|
| 11:15–11:45 | Special Track on Low Resource Settings |
| 11:15–11:30 | MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification in Low-Resource Settings Alice Schiavone, Marco Fraccaro, Lea Marie Pehrson, Silvia Ingala, Rasmus Bonnevie, Michael Bachmann Nielsen, Vincent Beliveau, Melanie Ganz and Desmond Elliott |
| 11:30–11:45 | MedNormJ: A Benchmark Dataset for Medical Concept Normalization in Japanese Clinical Documents Yuki Tashiro, Seiji Shimizu, Tomohiro Nishiyama, Shoko Wakamiya and Eiji ARAMAKI |
| 11:45–12:15 | Special Track on Construction or Deconstruction of LLMs |
| 11:45–12:00 | Pediatric Sepsis Cohort Detection Using In-Context Pointwise V-Usable Information Yingya Li, Alon Geva, Steven Bethard, Timothy A. Miller, Kate Madden, Matthew A. Eisenberg, Daniel P. Kelly and Guergana Savova |
| 12:00–12:15 | RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems Adarsh Srinivasan, Jacob Dineen, Muhammad Uzair Sarfraz, Muhammad Umar Afzal, Irbaz Riaz and Ben Zhou |
| 12:15–13:00 | Main Track |
| 12:15–12:30 | Disentangling Ambiguity from Instability in Large Language Models: A Clinical Text-to-SQL Case Study Angelo Ziletti and Leonardo D'Ambrosi |
| 12:30–12:45 | An OMOP-Based Open-Source Text-to-SQL Benchmark Dataset Paul Legrand, Kawsar Noor, Satyam Bhagwanani and Richard J. Dobson |
| 12:45–13:00 | Profiling Hallucinations in Frontier LLMs for Entity Linking to Medical Ontologies Logan Born, Nishant Kambhatla, Uliyana Kubasova, Maryam Siahbani, Andrei Vacariu, Timothy W. O'Connell and Anoop Sarkar |
| 13:00–13:00 | Closing |