Selected Publications
2024
- Irune Zubiaga, Aitor Soroa, Rodrigo Agerri (2024). A LLM-based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation. In Findings of EMNLP 2024.
- Blanca Calvo Figueras, Rodrigo Agerri (2024). Critical Questions Generation: Motivation and Challenges. In CoNLL 2024.
- Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri (2024). CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures. In EMNLP 2024.
- Zubiaga, I., A. Soroa, and R. Agerri. 2024. Ixa at refutes 2024: Leveraging Language Models for Counter Narrative Generation. In IberLEF (Working Notes) at SEPLN 2024. CEUR Workshop Proceedings.
- Anar Yeginbergen, Maite Oronoz, Rodrigo Agerri (2024). Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques. In ACL 2024.
- Iñigo Alonso, Maite Oronoz and Rodrigo Agerri (2024). MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering. In Artificial Intelligence in Medicine (Elsevier).
- Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo Agerri (2024). Explanatory argument extraction of correct answers in resident medical exams. In Artificial Intelligence in Medicine, 157(102985) (Elsevier).
- Olia Toporkov and Rodrigo Agerri (2024). On the Role of Morphological Information for Contextual Lemmatization. In Computational Linguistics (MIT Press). Presented at the main conference of EMNLP 2023.
- Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre, 2024. GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction. In the Twelfth International Conference on Learning Representations (ICLR 2024).
- Cristian Cardellino, Theo Collias, Benjamin Molinet, Erwan Hain, Wei Sun, Rodrigo Agerri, Serena Villata and Elena Cabrio (2024). ANTIDOTE: ArgumeNtaTIon-Driven explainable artificial intelligence fOr digiTal mEdicine. In ECAI 2024 demos.
- Masson, M., Roose, P., Sallaberry, C., Bessagnet, M.N., Le Parc Lacayrelle, A. and Agerri, R., 2024. ProxMetrics: modular proxemic similarity toolkit to generate domain-adaptable indicators from social media. Social Network Analysis and Mining, 14(1), pp.1-23.
- Anar Yeginbergen and Rodrigo Agerri (2024). Crosslingual Argument Mining in the Medical Domain. In Procesamiento del Lenguaje Natural (73).
- Rodrigo Agerri, Eneko Agirre, Gorka Azkune, Roberto Centeno, Anselmo Peñas, German Rigau, Álvaro Rodrigo, Aitor Soroa (2024). DeepKnowledge: Deep Multilingual Language Model Technology for Language Understanding. In SEPLN-CEDI-PD 2024: Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations, June 19-20, 2024, A Coruña, Spain.
- Rodrigo Agerri, Jeremy Barnes, Jaione Bengoetxea, Blanca Calvo Figueras, Joseba Fernandez de Landa, Iker García-Ferrero, Olia Toporkov, Irune Zubiaga (2024). HiTZ@Disargue: Few-shot Learning and Argumentation to Detect and Fight Misinformation in Social Media. In SEPLN-CEDI-PD 2024: Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations, June 19-20, 2024, A Coruña, Spain.
- Maxime Masson, Christian Sallaberry, Marie-Noelle Bessagnet, Annig Le Parc Lacayrelle, Philippe Roose and Rodrigo Agerri (2024). TextBI: An Interactive Dashboard for Visualizing Multidimensional NLP Annotations in Social Media Data. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024).
- Olia Toporkov and Rodrigo Agerri (2024). Evaluating Shortest Edit Script Methods for Contextual Lemmatization. In Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).
- Jaione Bengoetxea, Yi-Ling Chung, Marco Guerini and Rodrigo Agerri (2024). Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation. In Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).
- Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata and Andrea Zaninello (2024). Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain. In Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).
2023
- Iker García-Ferrero, Rodrigo Agerri, German Rigau (2023). T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks. Findings of the Association for Computational Linguistics: EMNLP 2023.
- Rodrigo Agerri and Eneko Agirre (2023). Lessons learned from the evaluation of Spanish Language Models. Procesamiento del Lenguaje Natural (70), pp 157-170. https://doi.org/10.26342/2023-70-13.
- Gorka Urbizu, Iñaki San Vicente, Xabier Saralegi, Rodrigo Agerri, Aitor Soroa (2023). Scaling Laws for BERT in Low-Resource Settings. Findings of the Association for Computational Linguistics: ACL 2023. SCIE Class 1.
- Nayla Escribano, German Rigau, Rodrigo Agerri (2023). A modular approach for multilingual timex detection and normalization using deep learning and grammar-based methods. Knowledge-Based Systems 273. JCR: 8.139 (Q1)
- Agerri, R. et al. (2023). State-of-the-Art in Language Technology and Language-centric Artificial Intelligence. In: Rehm, G., Way, A. (eds) European Language Equality. Cognitive Technologies. Springer, Cham.
- Roberto Centeno and Rodrigo Agerri (2023). Overview of NLP-MisInfo 2023: Workshop on NLP applied to Misinformation. In Proceedings of the Workshop on NLP applied to Misinformation, co-located with the 39th International Conference of the Spanish Society for Natural Language Processing (SEPLN 2023).
- Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli, Goutham Karunakaran (2023). Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements. Procesamiento del Lenguaje Natural (71), pp 313-320.
- Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.
- Joseba Fernandez de Landa, Rodrigo Agerri (2023). HiTZ-IXA at PoliticES 2023: Document and Sentence Level Text Representations for Demographic Characteristics and Political Ideology Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), Jaén, Spain, September 2023.
- Masson, M., Roose, P., Sallaberry, C., Agerri, R., Bessagnet, MN., Lacayrelle, A.L.P. (2023). APs: A Proxemic Framework for Social Media Interactions Modeling and Analysis. In: Crémilleux, B., Hess, S., Nijssen, S. (eds) Advances in Intelligent Data Analysis XXI. IDA 2023. Lecture Notes in Computer Science, vol 13876. Springer, Cham.
2022
Elisa Sanchez-Bayona and Rodrigo Agerri (2022). Leveraging a New Spanish Corpus for Multilingual and Crosslingual Metaphor Detection. In CoNLL 2022.
Iker García-Ferrero, Rodrigo Agerri and German Rigau (2022). Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings. Findings of the Association for Computational Linguistics: EMNLP 2022.
Mikel Artetxe, Itziar Aldabe, Rodrigo Agerri, Olatz Perez-de-Viñaspre, Aitor Soroa (2022). Does Corpus Quality Really Matter for Low-Resource Languages?. In EMNLP 2022.
Joseba Fernandez de Landa and Rodrigo Agerri (2022). Relational Embeddings for Language Independent Stance Detection. In Arxiv.
Maxime Masson, Christian Sallaberry, Rodrigo Agerri, Marie-Noelle Bessagnet, Philippe Roose, Annig Le Parc Lacayrelle (2022). A Domain-Independent Method for Thematic Dataset Building from Social Media: The Case of Tourism on Twitter. In: Chbeir, R., Huang, H., Silvestri, F., Manolopoulos, Y., Zhang, Y. (eds) Web Information Systems Engineering - WISE 2022. WISE 2022. Lecture Notes in Computer Science, vol 13724. Springer, Cham.
Rodrigo Agerri, Roberto Centeno, María Espinosa, Joseba Fernández de Landa, Álvaro Rodrigo (2022). VaxxStance: A Dataset for Cross-Lingual Stance Detection on Vaccines. In ICWSM 2022 Data Challenge. PDF preprint
Jeremy Barnes, Laura Oberländer, Enrica Troiano, Andrey Kutuzov, Jan Buchmannn, Rodrigo Agerri, Lilja Øvrelid, Erik Velldal (2022). SemEval 2022 Task 10: Structured Sentiment Analysis. In SemEval 2022. PDF preprint
Nayla Escribano, Jon Ander González, Julen Orbegozo-Terradillos, Ainara Larrondo-Ureta, Simón Peña-Fernández, Olatz Perez-de-Viñaspre and Rodrigo Agerri (2022). BasqueParl: A Bilingual Corpus of Basque Parliamentary Transcriptions. In LREC 2022.
Gorka Urbizu, Iñaki San Vicente, Xabier Saralegi, Rodrigo Agerri and Aitor Soroa (2022). BasqueGLUE: A Natural Language Understanding Benchmark for Basque. In LREC 2022.
Blanca Calvo Figueras, Montse Cuadros, Rodrigo Agerri (2022). A Semantics-Aware Approach to Automated Claim Verification. In Proceedings of the Fifth Fact Extraction and VERification Workshop (FEVER).
Nayla Escribano, Jon Ander González, Julen Orbegozo-Terradillos, Ainara Larrondo-Ureta, Simón Peña-Fernández, Olatz Pérez-de-Viñaspre, Rodrigo Agerri (2022). Euskararen erabilera Eusko Legebiltzarreko debateetan (2012-2020). In Mediatika, 19, 163-178.
2021
Iker García-Ferrero, Rodrigo Agerri and German Rigau (2021). Benchmarking Meta-embeddings: What Works and What Does Not. Findings of the Association for Computational Linguistics: EMNLP 2021.
Yi-Ling Chung, Marco Guerini and Rodrigo Agerri (2021). Multilingual Counter Narrative Type Classification. In Argument Mining 2021.
Elena Zotova, Rodrigo Agerri, German Rigau (2021). Semi-automatic generation of multilingual datasets for stance detection in Twitter. Expert Systems with Applications, 170 (2021). JCR: 5.452 (Q1). https://doi.org/10.1016/j.eswa.2020.114547 [PDF preprint]
Ainhoa Serna, Aitor Soroa, Rodrigo Agerri. Applying Deep Learning Techniques for Sentiment Analysis to Assess Sustainable Transport. Sustainability. 2021; 13(4):2397. https://doi.org/10.3390/su13042397
Rodrigo Agerri, Roberto Centeno, María Espinosa, Joseba Fernández de Landa, Álvaro Rodrigo (2021). VaxxStance@IberLEF 2021: Overview of the Task on Going Beyond Text in Cross-Lingual Stance Detection. Procesamiento del Lenguaje Natural, 67, pp 173-181.
Joseba Fernandez de Landa & Rodrigo Agerri (2021): Social analysis of young Basque-speaking communities in twitter, Journal of Multilingual and Multicultural Development, DOI: 10.1080/01434632.2021.1962331. JCR: 2.814 (Q1). [PDF preprint]
Joseba Fernandez de Landa, Rodrigo Agerri (2021). Euskarazko on-line artikuluetan aipatutako izendun entitate nabarmenen identifikazioa denbora errealean. Ekaia, UPV/EHU Press. https://doi.org/10.1387/ekaia.22123.
2020
Rodrigo Agerri, German Rigau (2020). Language independent sequence labelling for Opinion Target Extraction. In IJCAI 2020.
María Espinosa, Rodrigo Agerri, Roberto Centeno, Alvaro Rodrigo (2020). DeepReading@SardiStance:Combining Textual, Social and Emotional Features. Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA 2020). Winners of the SardiStance@Evalita 2020 shared task
Rodrigo Agerri, German Rigau (2020). Projecting Heterogeneous Annotations for Named Entity Recognition. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020). Winner of the CAPITEL@IberLEF task on Spanish NER
Rodrigo Agerri, Iñaki San Vicente, Jon Ander Campos, Ander Barrena, Xabier Saralegi, Aitor Soroa and Eneko Agirre (2020). Give your Text Representation Models some Love: the Case for Basque. In LREC 2020.
Elena Zotova, Rodrigo Agerri, Manuel Nuñez and German Rigau (2020). Multilingual Stance Detection in Tweets: The Catalonia Independence Corpus. In LREC 2020.
2019
Rodrigo Agerri, German Rigau, Language independent sequence labelling for Opinion Target Extraction. Artificial Intelligence, 268 (2019) 85-95. JCR: 6.628 (Q1). [Preprint PDF] https://doi.org/10.1016/j.artint.2018.12.002
Joseba Fernandez de Landa, Rodrigo Agerri, Iñaki Alegria (2019). Large Scale Linguistic Processing of Tweets to Understand Social Interactions among Speakers of Less Resourced Languages: The Basque Case. Information 10(6): 212 (2019). [PDF preprint] https://doi.org/10.3390/info10060212
Rodrigo Agerri (2019). Doris Martin at SemEval-2019 Task 4: Hyperpartisan News Detection with Generic Semi-supervised Features. SemEval@NAACL-HLT 2019: 944-948.
Joseba Fernandez de Landa Aguirre; Rodrigo Agerri; Iñaki Alegria. Euskaldun gazte eta helduen harremanak Twitterren. III. Ikergazte. Nazioarteko ikerketa euskaraz. Kongresuko artikulu bilduma. Gizarte Zientziak eta Zuzenbidea. 2, pp. 83 - 90. Udako Euskal Unibertsitatea (UEU), 2019.
2018
Rodrigo Agerri, Yiling Chung, Itziar Aldabe, Nora Aranberri, Gorka Labaka and German Rigau (2018). Building Named Entity Recognition Taggers via Parallel Corpora. In Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), 7-12 May, 2018, Miyazaki, Japan.
Rodrigo Agerri, Xavier Gómez Guinovart, German Rigau and Miguel Anxo Solla Portela (2018). Developing New Linguistic Resources and Tools for the Galician Language. In Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), 7-12 May, 2018, Miyazaki, Japan.
Noelia Migueles-Abraira, Rodrigo Agerri and Arantza Diaz de Ilarraza (2018). Annotating Abstract Meaning Representations for Spanish. In Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), 7-12 May, 2018, Miyazaki, Japan.
Rodrigo Agerri, Núria Bel, German Rigau, Horacio Saggion (2018). TUNER: Multifaceted Domain Adaptation for Advanced Textual Semantic Processing. First Results Available. Procesamiento del Lenguaje Natural 61: 163-166 (2018). http://dx.doi.org/10.26342/2018-61-23
Rodrigo Agerri, Montse Maritxalar, Verena Lyding, Lionel Nicolas (2018). enetCollect: A New European Network for combining Language Learning with Crowdsourcing Techniques.. Procesamiento del Lenguaje Natural 61: 171-174 (2018). http://dx.doi.org/10.26342/2018-61-25
Rodrigo Agerri and German Rigau (2018). Simple Language Independent Sequence Labelling for the Annotation of Disabilities in Medical Texts. In Proceedings of the Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018), Diann Track, Sevilla, Spain.
2017
Egoitz Laparra, Rodrigo Agerri, Itziar Aldabe, German Rigau. Multi-lingual and Cross-lingual timeline extraction. Knowledge-Based Systems, 133, 77-89, 2017. JCR 2016: 3.325 (Q1). [Preprint PDF] https://doi.org/10.1016/j.knosys.2017.07.002
Rodrigo Agerri and German Rigau. Robust Multilingual Named Entity Recognition with Shallow Semi-supervised Features. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17), 4965-4970, Melbourne, Australia.
Rodrigo Agerri and German Rigau. Applying Existing Named Entity Taggers at BARR IBEREVAL 2017 Task. In Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017)
Rodrigo Agerri, Itziar Aldabe, Nora Aranberri, Yiling Chung, Gorka Labaka and German Rigau. Automatic Generation of Named Entity Recognition Taggers Using Parallel Corpora. In Proceedings of TUNER at SEPLN 2017.
2016
R. Agerri, G. Rigau, Robust multilingual Named Entity Recognition with shallow semi-supervised features. Artificial Intelligence, 238 (2016) 63-82. JCR 2015: 3.371 (Q1). http://dx.doi.org/10.1016/j.artint.2016.05.003 [Preprint PDF]
Piek Vossen, Rodrigo Agerri, Itziar Aldabe, Agata Cybulska, Marieke van Erp, Antske Fokkens, Egoitz Laparra, Anne-Lyse Minard, Alessio Palmero Aprosio, German Rigau, Marco Rospocher, Roxane Segers, NewsReader: Using knowledge resources in a cross-lingual reading machine to generate more knowledge from massive streams of news. Knowledge-Based Systems, 110 (2016), 60-85. JCR 2015: 3.325 (Q1). http://dx.doi.org/10.1016/j.knosys.2016.07.013 PDF
Agerri A., Aldabe I., Laparra E., Rigau G., Fokkens A., Huijgen P., van Erp M., Izquierdo R., Vossen P., Minard A. and Magnini B. Multilingual Event Detection using the NewsReader pipelines. Workshop on Cross-Platform Text Mining and Natural Language Processing Interoperability at the 10th Language Resources and Evaluation Conference (LREC’16). Portorož, Slovenia. 2016.
2015
Agerri, R., Artola X., Beloki Z., Rigau G. and Soroa A. Big Data for Natural Language Processing: A Streaming Approach. Knowledge-Based Systems. Volume 79, May 2015, Pages 36-42, ISSN 0950-7051. 2015. JCR 2014 : 2.947. https://doi.org/10.1016/j.knosys.2014.11.007 [Preprint PDF]
I. San Vicente, X. Saralegi and R. Agerri (2015) EliXa: A modular and flexible ABSA platform. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, USA, pp. 748-752, 2015. Winner of the Opinion Target Extraction task
2014
Rodrigo Agerri, Josu Bermudez and German Rigau (2014): IXA pipes: Efficient and Ready to Use Multilingual NLP tools. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), 26-31 May, 2014, Reykjavik, Iceland.
Iñaki San Vicente, Rodrigo Agerri and German Rigau (2014): Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL2014), April 26-30, 2014, Gothenburg, Sweden.
Rodrigo Agerri, Josu Bermudez and German Rigau (2014): Multilingual, Efficient and Easy NLP Processing with IXA Pipeline. In Proceedings of the Demo Sessions of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL2014), April 26-30, 2014, Gothenburg, Sweden.
Agerri R., Agirre E., Aldabe I., Altuna B., Beloki Z., Laparra E., López de Lacalle M., Rigau G., Soroa A., Urizar R. The NewsReader project. Proceedings of the 30th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN’14). Girona, Spain. Procesamiento del Lenguaje Natural. Vol. 53 pp. pp 215-218. ISSN: 1135-5948. 2014.
Isa Maks, Ruben Izquierdo, Francesca Frontini, Montse Cuadros, Rodrigo Agerri and Piek Vossen (2014). Generating Polarity Lexicons with WordNet propagation in 5 languages. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), 26-31 May, 2014, Reykjavik, Iceland.
2013-2011
Rodrigo Agerri, Montse Cuadros, Sean Gaines and German Rigau (2013). OpeNER: Open Polarity Enhanced Named Entity Recognition. Procesamiento del Lenguaje Natural, Volume 51: 215-218. PDF
Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Maucec, Andy Way, Panayota Georgakopoulou and Martin Volk (2012). SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles. In the Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, 2012.
A.M. Wallington, R. Agerri, J.A. Barnden, M.G. Lee, T. Rumbell (2011). Affect Transfer by Metaphor for an Intelligent Conversational Agent. In K. Ahmad (ed.), Affective Computing and Sentiment Analysis: Metaphor, Ontology, Affect and Terminology. Text, Speech and Language Technology Series, pages 61-74, ISBN 978-94-007-1756-5, Springer, Heidelberg.
2010 and before
Rodrigo Agerri and Ana Garcia-Serrano (2010). Q-WordNet: Extracting polarity from WordNet senses. In the Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 2010.
Rodrigo Agerri and Anselmo Peñas (2010). On the Automatic Generation of Intermediate Logic Form for WordNet glosses. 11th International Conference on Intelligent Text Processing and Computational Linguistics (Cicling-2010), Iasi, Romania, 21-27 March. LNCS Volume 6008 by Springer.
M. Volk, A. del Pozo, R. Agerri (2010). Multilingual subtitling in the age of Google Translate. Language and the Media Conference, Berlin, 6-8 October, 2010.
Rodrigo Agerri (2008). Metaphor in Textual Entailment. Companion Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), Manchester, UK, pages 2-6.
R. Agerri, J.A. Barnden, M.G. Lee and A.M. Wallington (2007). Metaphor, Inference and Domain Independent Mappings. Proceedings of the Research Advances in Natural Language Processing Conference (RANLP-07), Borovets, Bulgaria, 27-29 September 2007.
Book translation
John Perry, Referencialismo critico. La teoria reflexivo-referencial del significado. Stanford: CSLI Publications, 2006. Translated by Kepa Korta and Rodrigo Agerri.