Text Mining and Retrieval Leiden

People

Group members

Suzan Verberne,
Associate Professor at the Leiden Institute of Advanced Computer Science (LIACS)
Group leader
Alex Brandsen,
Postdoc in the faculty of Archeology/LIACS
EXALT: Excavating Archaeological Literature
Amin Abolghasemi,
PhD student at the Leiden Institute of Advanced Computer Science (LIACS)
Graph Browsing for Professional Search
Arian Askari,
PhD student at the Leiden Institute of Advanced Computer Science (LIACS)
Transparent User Profiling for Professional Search
Gineke Wiggers,
PhD student in the faculty of Law
The Relevance of Impact: bibliometric-enhanced legal information retrieval
Hugo de Vos,
PhD student in the faculty of Governance and Global affairs (FGGA)
Automated analysis of political documents and processes
Juan Bascur Cifuentes,
PhD student at the Centre for Science and Technology Studies (CWTS)
Interactive visual browsing and retrieval of scientific literature
Keyan Jin,
PhD student LIACS/Science-based business
Linking patents to scientific publications through in-text reference
Myrthe Reuver,
PhD candidate at the VU Amsterdam
Diversity in news recommender systems
Sophia Althammer,
PhD student at the TU Wien
Information in Production
Zahra Abbasiyantaeb,
Research assistant LIACS/Science-based business
Linking patents to scientific publications through in-text reference mining

PhD graduates from the group

Anne Dirkson,
PhD at Leiden Institute of Advanced Computer Science (LIACS)
Knowledge Discovery from Patient Forums
Xue Wang,
joint PhD with Xi'an Jiaotong University
Unified Knowledge Base learning from text and images

Affiliated researchers

  • Ben Companjen, Digital Scholarship Librarian at Leiden University Library
  • Bram van Dijk, PhD student at LIACS (Media Technology)
  • Jelena Prokic, Assistant Professor at the Leiden University Centre for Digital Humanities
  • Kalliopi Zervanou, Assistant Professor at LUMC and LIACS
  • Lauren Fonteyn, Assistant Professor at LUCL
  • Marc van Oudheusden, external PhD student at the Leiden Institute for Area Studies
  • Marieke van Buchem, PhD student at LUMC
  • Michiel van der Meer, PhD student at LIACS (Hybrid Intelligence)
  • Shima Javanmardi, PhD student at LIACS (in the group of Fons Verbeek)
  • Wessel Kraaij, Professor at LIACS
  • Wout Lamers, PhD student at the Centre for Science and Technology Studies (CWTS)

Former group members and guests

  • Anne Dirkson
  • Benjamin van der Burgh
  • Arianna Bisazza
  • Martin Kroon
  • Paul Vierthaler
  • Prajit Dhar
  • Wout Lamers
  • Xue Wang

Current master student projects

Student nameYearTopicCo-supervisor
Eirini Kousathana2022Opinionated content summarization
Melle van der Meulen2022Automatic classification of environmental product foodprintsJulian Karch
Luuk Nolden 2021A model for automatic scansion of Latin textMatthew Payne
Murad H. Bozik2021Analyzing open-ended questions in hospital questionnairesIlse Kant
Xiao Zhang2021Stance detectionJohan Bos (Groningen)
Ricardo Michels2020Spread of Covid-19 misinformation in social media Frank Takes
Tristan Kattenberg2020Text Generation For Clinical Records

Former master student projects

Student nameYearTopicCo-supervisor
Stratos Triantafyllou 2021Information extraction for complex entitiesKarl Aberer / Rémi Lebret, EPFL
Yanfang Hou2020Stance detection for Covid-19 misinformation in social media
Martin Koole2020The automatic labelling of medical entities in Dutch endoscopy reportsAisha Sie, DearHealth
Julius Cathalina2020Preparing the "US Food and Drug Administration Adverse Event Reporting System" (FAERS) datasetCoen van Hasselt, LACDR
Chen Wang2021Using ensemble models with structural information in social media to aid rumour stance classificationAnne Dirkson
Keyan Jin2021Predicting Stock Price Movement with Multi-source InformationJian Wang
Zhenyu Guo2021When can BERT beat SVM? Replication of four text classification papers and fine-tuning RobBERT on Dutch language datasetsPeter van der Putten
Jiakun Sun2021Automatic Named Entity Recognition for ASR output in the Political DomainHugo de Vos
Marcus Abukari2020Predictive clinical models based on BERTHine van Os, LUMC
Mohamed Barbouch2020Integration of BERT and WordNet for improving Natural Language UnderstandingTessa Verhoef
Ken Voskuil2020Extracting scientific citations from patent textsJian Wang
Francesco Bovo2020Clinical Temporal Relation Extraction: Towards aPatient’s Timeline CreationVeysel Kocaman
Rayan Suryadikara2020Political hate speech and misinformation detection from Indonesian social mediaFrank Takes
Shutong Zeng2020Systematic review classificationChava Ramspek, LUMC
Ioannis Chios2020Explainable Information RetrievalStephan Raaijmakers, LUCL
Martin Koole2019Multi-label classification of archaeological excavation reportsAlex Brandsen
Anneloes Louwe2019Hacking Women’s Stroke – Exploring the free text in patients’ filesHine van Os, LUMC
Yuting Hu2019Biomedical Entity Recognition in Chinese patent documentsMagnus Palmblad, LUMC
Ioannis Chios2019Citation extraction from patent texts using the Flair frameworkJian Wang
Xiaoling Zhang2019Text mining on Biorefinery Published ArticlesBernhard Steubing, CML
Natalia Bukarina2019Ranking candidates to job descriptions YoungCapital
Wilco Draijer2018Case law retrieval Ortec
Soeradj Kanhai2018Reach and content of Dutch clickbait on Facebookdr. Peter Burger
Thomas Prikkel 2018Reducing manual labor in Technology-Assisted Review Prof. Johannes C. Scholtes, Zylab
Renuka Ramgolam2018Visualization of privacy-sensitive data in local governments Motion10