Synthetic vs. Real Reference Strings for Citation Parsing, and the Importance of Re-training and Out-Of-Sample Data for Meaningful Evaluations: Experiments with GROBID, GIANT and CORA [pre-print]

ABSTRACT Citation parsing, particularly with deep neural networks, suffers from a lack of training data as available datasets typically contain only a few thousand training instances. Manually labelling citation strings is very time-consuming, hence synthetically created training data could be Read more…

Open Position at Trinity College Dublin: Full Professorship in Computer Science / Artificial Intelligence, Machine Learning, NLP, DKE … (€117k — €151k/pa)

After recently advertising an Assistant Professorship in Computer science (Artificial Intelligence), Trinity College Dublin now has an additional position to fill with a Full Professor in Computer Science focusing on Artificial Intelligence and related disciplines (machine learning, natural language processing, Read more…