Research on LLMs, generation, and grounding

2026

LLM

What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation

Shaomu Tan, Dawei Zhu, Ke Tran, Michael Denkowski, Sony Trenous, Bill Byrne, Leonardo F. R. Ribeiro, et al.

ACL 2026

Agent

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

Shanshan Zhong, Yi Lu, Jingjie Ning, Yibing Wan, Lihan Feng, Yuyi Ao, Leonardo F. R. Ribeiro, Markus Dreyer, Sean Ammirati, Chenyan Xiong

arXiv, 2026

VLM

Benchmarking Deflection and Hallucination in Large Vision-Language Models

Nicholas Moratelli, Christopher Davis, Leonardo F. R. Ribeiro, Bill Byrne, Gonzalo Iglesias

ACL 2026

Eval

DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality

Yukun Huang, Leonardo F. R. Ribeiro, Momchil Hardalov, Bhuwan Dhingra, Markus Dreyer, Venkatesh Saligrama

ACL 2026

Safe

RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models

Aashiq Muhamed, Leonardo F. R. Ribeiro, Markus Dreyer, Virginia Smith, Mona T. Diab

EACL 2026

Patent

Evaluating Retrieval System for Language Model Processing

Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Matteo Gabburo, Siddhant Garg

US Patent US12,579,174

2025

RAG

RAGferee: Building Contextual Reward Models for Retrieval-Augmented Generation

Andrei C. Coman, Ionut-Teodor Sorodoc, Leonardo F. R. Ribeiro, Bill Byrne, James Henderson, Adrià de Gispert

EMNLP 2025

Agent

Deep Research Comparator: A Platform for Fine-Grained Human Annotations of Deep Research Agents

Prahaladh Chandrahasan, Jiahe Jin, Zhihan Zhang, Tevin Wang, Andy Tang, Lucy Mo, Morteza Ziyadi, Leonardo F. R. Ribeiro, Zimeng Qiu, Markus Dreyer, Akari Asai, Chenyan Xiong

WebConf 2026 Demo

RAG

GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation

Ionut-Teodor Sorodoc, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Christopher Davis, Adrià de Gispert

ACL 2025 Findings

RAG

XRAG: Cross-Lingual Retrieval-Augmented Generation

Wei Liu, Sony Trenous, Leonardo F. R. Ribeiro, Bill Byrne, Felix Hieber

EMNLP 2025 Findings

QA

NeoQA: Evidence-Based Question Answering with Generated News Events

Max Glockner, Xiang Jiang, Leonardo F. R. Ribeiro, Iryna Gurevych, Markus Dreyer

ACL 2025 Findings

Model

The Amazon Nova Family of Models: Technical Report and Model Card

Amazon AGI, Amazon Artificial General Intelligence, et al.

arXiv, 2025

Align

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

NAACL 2025 Findings

Model

Amazon Nova 2: Multimodal Reasoning and Generation Models

Amazon AGI

Technical Report, 2025

Model

Amazon Nova Premier: Technical Report and Model Card

Amazon AGI

Technical Report, 2025

2024

RAG

Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA

Nirmal Roy, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Kevin Small

EMNLP 2024 Findings

LLM

Speechworthy Instruction-Tuned Language Models

Hyundong Cho, Nicolaas Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May

EMNLP 2024

Tool

FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-Tracked Constrained Decoding and Reranking

Zhuoer Wang, Leonardo F. R. Ribeiro, Alexandros Papangelis, Rohan Mukherjee, Tzu-Yen Wang, Xinyan Zhao, Arijit Biswas, James Caverlee, Angeliki Metallinou

EMNLP 2024 Findings

IR

Measuring Retrieval Complexity in Question Answering Systems

Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti

ACL 2024 Findings

Eval

On the Role of Summary Content Units in Text Summarization Evaluation

Marcel Nawrath, Agnieszka Wiktoria Nowak, Tristan Ratz, Danilo Constantin Walenta, Juri Opitz, Leonardo F. R. Ribeiro, Joao Sedoc, et al.

NAACL 2024

MM

REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset

Vaidehi Patil, Leonardo F. R. Ribeiro, Mengwen Liu, Mohit Bansal, Markus Dreyer

ACL 2024

Edit

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts)

Luis Chiruzzo, Hung-yi Lee, Leonardo F. R. Ribeiro, editors

ACL 2024 Tutorial Abstracts

2023

NLG

Generating Summaries with Controllable Readability Levels

Leonardo F. R. Ribeiro, Mohit Bansal, Markus Dreyer

EMNLP 2023

Robotics

Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning

Georgia Chalvatzaki, Ali Younes, Daljeet Nandha, An Thai Le, Leonardo F. R. Ribeiro, Iryna Gurevych

Frontiers in Robotics and AI, 2023

2022

IR

Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking

Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych

EMNLP 2022

QA

UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

AACL 2022 Demo

NLG

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, et al.

EMNLP 2022 Demo

Eval

FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal

NAACL 2022

QA

UKP-SQUARE: An Online Platform for Question Answering Research

Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, et al.

ACL 2022 Demo

Thesis

Graph-Based Approaches to Text Generation

Leonardo F. R. Ribeiro

Ph.D. thesis, Technical University of Darmstadt, 2022

2021

Graph

A Neural Graph-Based Local Coherence Model

Mohsen Mesgar, Leonardo F. R. Ribeiro, Iryna Gurevych

EMNLP 2021 Findings

AMR

Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation

Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang, Iryna Gurevych

EMNLP 2021

AMR

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Leonardo F. R. Ribeiro, Yue Zhang, Iryna Gurevych

EMNLP 2021

NLG

Investigating Pretrained Language Models for Graph-to-Text Generation

Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

NLP4ConvAI, EMNLP 2021

NLG

Modeling Graph Structure via Relative Position for Better Text Generation from Knowledge Graphs

Martin Schmitt, Leonardo F. R. Ribeiro, Philipp Dufter, Iryna Gurevych, Hinrich Schütze

TextGraphs, NAACL 2021

2020

NLP

Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

Anne Lauscher, Olga Majewska, Leonardo F. R. Ribeiro, Iryna Gurevych, Nikolai Rozanov, Goran Glavaš

DeeLIO, EMNLP 2020

NLG

Metaphoric Paraphrase Generation

Kevin Stowe, Leonardo F. R. Ribeiro, Iryna Gurevych

arXiv, 2020

NLG

Modeling Global and Local Node Contexts for Text Generation from Knowledge Graphs

Leonardo F. R. Ribeiro, Yue Zhang, Claire Gardent, Iryna Gurevych

TACL 2020

2019

AMR

Enhancing AMR-to-Text Generation with Dual Graph Representations

Leonardo F. R. Ribeiro, Claire Gardent, Iryna Gurevych

EMNLP 2019

Law

Análise e Ranqueamento da Rede de Advogados induzida por Processos Judiciais Trabalhistas

Leonardo F. R. Ribeiro, Daniel R. Figueiredo, Pedro R. Nascimento

BraSNAM 2019

Eval

Ranking Generated Summaries by Correctness: An Interesting but Challenging Application for Natural Language Inference

Tobias Falke, Leonardo F. R. Ribeiro, Prasetya Ajie Utama, Ido Dagan, Iryna Gurevych

ACL 2019

2018 and Earlier

Game

Performance of Monte Carlo Tree Search Algorithms when Playing the Game Ataxx

Leonardo F. R. Ribeiro, Daniel R. Figueiredo

ENIAC 2018

Graph

struc2vec: Learning Node Representations from Structural Identity

Leonardo F. R. Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo

KDD 2017

Law

Ranking Lawyers Using a Social Network Induced by Legal Cases

Leonardo F. R. Ribeiro, Daniel R. Figueiredo

Journal of the Brazilian Computer Society, 2017

Code

FlowTracker: Detecção de Código Não Isócrono via Análise Estática de Fluxo

Bruno R. Silva, Leonardo F. R. Ribeiro, Diego Aranha, Fernando M. Q. Pereira

CBSOFT 2015