NLM IRP Seminar Schedule
UPCOMING SEMINARS
-
May 7, 2024 OPEN
TBD -
May 9, 2024 Pascal Mutz
The Riboviria protein structurome expands virus protein annotation and highlights protein relations -
May 14, 2024 Stanley Liang
TBD -
May 16, 2024 Diego Salazar
TBD -
May 21, 2024 Ziynet Kesimoglu
TBD
RECENT SEMINARS
-
May 2, 2024 OPEN
TBD -
April 30, 2024 Wenya Rowe
The conformal central charge of the spin-1/2 XX model derived from long-chain asymptotics -
April 25, 2024 Ermin Hodzic
Condition-Aware Cell Type Deconvolution of Bulk Tissues -
April 16, 2024 Jaya Srivastava
Regulatory plasticity of the human genome -
April 11, 2024 Sergey Shmakov
Comprehensive survey of the TnpB RNA-guided nucleases
Scheduled Seminars on Jan. 20, 2022
Contact NLM_IRP_Seminar_Scheduling@mail.nih.gov with questions about this seminar.
Abstract:
Since a genome is essentially a document written in the alphabet of nucleotides, the field of Computational Biology has been informed by Natural Language Processing techniques since its inception. In this talk I will describe how "MinHash", a relatively obscure algorithm developed for searching the web, has been transformative for the task of genomic similarity estimation. I will go into how and why the algorithm works for sequences of nucleotides and amino acids rather than natural language documents, and I will discuss the creation and validation of tools employing the algorithm, variations for different kinds of searches, and the range of applications it can help with.