NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
Jan. 22, 2026 Mario Flores
AI Pipeline for Characterization of the Tumor Microenvironment -
Jan. 27, 2026 Zhaohui Liang
Heterogeneous Graph Re-ranking for CLIP-based Medical Cross-modal Retrieval -
Jan. 29, 2026 Mehdi Bagheri Hamaneh
FastSpel: A simple peptide spectrum predictor that achieves deep learning-level performance at a fraction of the computational cost -
Feb. 3, 2026 Matthew Diller
TBD -
Feb. 5, 2026 Lana Yeganova
From Algorithms to Insights: Bridging AI and Topic Discovery for Large-Scale Biomedical Literature Analysis.
RECENT SEMINARS
-
Jan. 20, 2026 Anastasia Gulyaeva
Diversity and evolution of the ribovirus class Stelpaviricetes -
Jan. 8, 2026 Won Gyu Kim
LitSense 2.0: AI-powered biomedical information retrieval with sentence and passage level knowledge discovery -
Dec. 16, 2025 Sarvesh Soni
ArchEHR-QA: A Dataset and Shared Task for Grounded Question Answering from Electronic Health Records -
Dec. 2, 2025 Qingqing Zhu
CT-Bench & CARE-CT: Building Reliable Multimodal AI for Lesion Analysis in Computed Tomography -
Nov. 25, 2025 Jing Wang
MIMIC-EXT-TE: Millions Clinical Temporal Event Time-Series Dataset
Scheduled Seminars on Nov. 25, 2025
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Early detection and timely treatment are critical in medicine. For example, surgical excision of skin lesions can cure early-stage skin cancer, but once metastasis occurs, even the most advanced therapies often fail. In this work, we introduce MIMIC-EXT-TE, a large-scale dataset provides a structured timeline of over a million clinical events from MIMIC-IV-Note. It is the first dataset with temporal information of events in patient level. To achieve the dataset, we propose to integrate retrieval-augmented generation with large language models to capture the temporal trajectories of patient events. To evaluate the dataset, we introduce TimeLife, a temporal-aware medical question answering system by fine-tuning the Qwen3-4B-Base language model on our dataset. TimeLife achieves an 18% overall accuracy boost on MedMCQA dataset compared with the base model. By fine-tuning TimeLife with downstream tasks such as PubMedQA and MedMCQA, TimeLife achieves the superiority most of the time compared with fine-tuning only on the base model without our dataset.