Text Mining and Retrieval Leiden

People

Group members

Suzan Verberne,
Assistant Professor at the Leiden Institute of Advanced Computer Science (LIACS)
Group leader
Alex Brandsen,
PhD student in the faculty of Archeology
Big data in archaeology: harnessing the hidden knowledge in the “graveyard” of Malta reports
Anne Dirkson,
PhD student at the Leiden Institute of Advanced Computer Science (LIACS)
Knowledge Discovery and Data Mining from patient experience repositories
Arian Askari,
PhD student at the Leiden Institute of Advanced Computer Science (LIACS)
Transparent User Profiling for Professional Search
Gineke Wiggers,
PhD student in the faculty of Law
Measuring relevance and relations of Dutch legal publications
Hugo de Vos,
PhD student in the faculty of Governance and Global affairs (FGGA)
Automated text analysis of policy-related documentation
Juan Bascur Cifuentes,
PhD student at the Centre for Science and Technology Studies (CWTS)
Interactive visual browsing and retrieval of scientific literature
Martin Kroon,
PhD student in the faculty of Humanities
Detecting cross-linguistic syntactic differences automatically
Wout Lamers,
PhD student at the Centre for Science and Technology Studies (CWTS)
Understanding scientific progress by analysing the context of scholarly citations
Xue Wang,
Visiting PhD student from Xi'an Jiaotong University
Unified Knowledge Base learning from text and images

Affiliated researchers

  • Anne-Fleur van Luenen, PhD student at the University of Amsterdam
  • Ben Companjen, Digital Scholarship Librarian at Leiden University Library
  • Benjamin van der Burgh, self-employed and guest researcher at LIACS
  • Bram van Dijk, PhD student at LIACS (Media Technology)
  • Jelena Prokic, Assistant Professor at the Leiden University Centre for Digital Humanities
  • Lian Yuchen, PhD candidate at LIACS (Media Technology)
  • Lauren Fonteyn, Assistant Professor at LUCL
  • Marc van Oudheusden, external PhD student at the Leiden Institute for Area Studies
  • Marieke van Buchem, PhD student at LUMC
  • Michiel van der Meer, PhD student at LIACS (Hybrid Intelligence)
  • Moritz Müller, PhD candidate at the Faculty of Governance and Global Affairs
  • Myrthe Reuver, PhD candidate at the VU Amsterdam
  • Sophia Althammer, PhD student at the TU Wien
  • Peter Verhaar, Digital Scholarship Librarian at Leiden University Library
  • Shima Javanmardi, PhD student at LIACS (in the group of Fons Verbeek)
  • Sophie Koning, PhD student in the Faculty of Law
  • Wessel Kraaij, Professor at LIACS

Current master student projects

Student nameYearTopicCo-supervisor
Jiakun Sun2020AutoNER from automatically transcribed political meetingsHugo de Vos
Ricardo Michels2020Spread of Covid-19 misinformation in social media Tim A Majchrzak, Uni of Agder
Yanfang Hou2020Spread of Covid-19 misinformation in social media Tim A Majchrzak, Uni of Agder
Zhenyu Guo2020When can BERT beat the baseline in text classification
Tristan Kattenberg2020Text Generation For Clinical Records
Keyan Jin2020Text mining for financial forecastingJian Wang
Julius Cathalina2020Preparing the "US Food and Drug Administration Adverse Event Reporting System" (FAERS) datasetCoen van Hasselt, LACDR
Marcus Abukari2020Predictive clinical models based on BERTHine van Os, LUMC
Mohamed Barbouch2020Integration of BERT and WordNet for improving Natural Language Understanding
Martin Koole2020The automatic labelling of medical entities in Dutch endoscopy reportsAisha Sie, DearHealth
Chen Wang2020The automatic labelling of medical entities in Dutch endoscopy reportsAnne Dirkson
Abdullah Alsaubie2020Few-shot medical concept normalization for ADRAnne Dirkson

Former master student projects

Student nameYearTopicCo-supervisor
Ken Voskuil2020Extracting scientific citations from patent textsJian Wang
Francesco Bovo2020Clinical Temporal Relation Extraction: Towards aPatient’s Timeline CreationVeysel Kocaman
Rayan Suryadikara2020Political hate speech and misinformation detection from Indonesian social mediaFrank Takes
Shutong Zeng2020Systematic review classificationChava Ramspek, LUMC
Ioannis Chios2020Explainable Information Retrieval
Martin Koole2019Multi-label classification of archaeological excavation reportsAlex Brandsen
Chen Wang2019Relation extraction from biomedical publicationsAnne Dirkson
Abdulah Alsaubie2019Medical sentiment analysisAnne Dirkson
Anneloes Louwe2019Hacking Women’s Stroke – Exploring the free text in patients’ filesHine van Os, LUMC
Yuting Hu2019Biomedical Entity Recognition in Chinese patent documentsdr. Magnus Palmblad, LUMC)
Ioannis Chios2019Citation extraction from patent texts using the Flair frameworkdr. Jian Wang
Xiaoling Zhang2019Text mining on Biorefinery Published Articles dr. Bernhard Steubing, CML
Natalia Bukarina2019Ranking candidates to job descriptions YoungCapital
Wilco Draijer2018Case law retrieval Ortec
Soeradj Kanhai2018Reach and content of Dutch clickbait on Facebookdr. Peter Burger
Thomas Prikkel 2018Reducing manual labor in Technology-Assisted Review Prof. Johannes C. Scholtes, Zylab
Renuka Ramgolam2018Visualization of privacy-sensitive data in local governments Motion10

Former group members

  • Arianna Bisazza
  • Paul Vierthaler
  • Prajit Dhar