Synthetic vs. Real Reference Strings for Citation Parsing, and the Importance of Re-training and Out-Of-Sample Data for Meaningful Evaluations: Experiments with GROBID, GIANT and CORA [pre-print]

ABSTRACT Citation parsing, particularly with deep neural networks, suffers from a lack of training data as available datasets typically contain only a few thousand training instances. Manually labelling citation strings is very time-consuming, hence synthetically created training data could be Read more…

Edward Bergman joins our group as a D-REAL PhD student for research on automated algorithm selection in Information Retrieval (AutoIR), Recommender Systems (AutoRecSys) and Machine Learning (AutoML)

We welcome Edward Bergman as a new full-time PhD student here at the School of Computer Science and Statistics in Trinity College Dublin, funded through the new D-REAL SFI Centre for Research Training (CRT) and supported by the ADAPT Centre. Read more…

Multi-stream Data Analytics for Enhanced Performance Prediction in Fantasy Football [Pre-Print] Abstract. Fantasy Premier League (FPL) performance predictors tend to base their algorithms purely on historical statistical data. The main problems with this approach is that external factors such as injuries, managerial decisions and other tournament match statistics can never Read more…

Open Position at Trinity College Dublin: Full Professorship in Computer Science / Artificial Intelligence, Machine Learning, NLP, DKE … (€117k — €151k/pa)

After recently advertising an Assistant Professorship in Computer science (Artificial Intelligence), Trinity College Dublin now has an additional position to fill with a Full Professor in Computer Science focusing on Artificial Intelligence and related disciplines (machine learning, natural language processing, Read more…