Research on LLMs, generation, and grounding

Google Scholar

2026

LLM

What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation

Shaomu Tan, Dawei Zhu, Ke Tran, Michael Denkowski, Sony Trenous, Bill Byrne, Leonardo F. R. Ribeiro, et al.

ACL 2026

PDF

Agent

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

Shanshan Zhong, Yi Lu, Jingjie Ning, Yibing Wan, Lihan Feng, Yuyi Ao, Leonardo F. R. Ribeiro, Markus Dreyer, Sean Ammirati, Chenyan Xiong

arXiv, 2026

PDF Project Code

VLM

Benchmarking Deflection and Hallucination in Large Vision-Language Models

Nicholas Moratelli, Christopher Davis, Leonardo F. R. Ribeiro, Bill Byrne, Gonzalo Iglesias

ACL 2026

PDF

Eval

DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality

Yukun Huang, Leonardo F. R. Ribeiro, Momchil Hardalov, Bhuwan Dhingra, Markus Dreyer, Venkatesh Saligrama

ACL 2026

PDF Code

Safe

RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models

Aashiq Muhamed, Leonardo F. R. Ribeiro, Markus Dreyer, Virginia Smith, Mona T. Diab

EACL 2026

PDF

Patent

Evaluating Retrieval System for Language Model Processing

Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Matteo Gabburo, Siddhant Garg

US Patent US12,579,174

Patent

2025

RAG

RAGferee: Building Contextual Reward Models for Retrieval-Augmented Generation

Andrei C. Coman, Ionut-Teodor Sorodoc, Leonardo F. R. Ribeiro, Bill Byrne, James Henderson, Adrià de Gispert

EMNLP 2025

PDF Code

Agent

Deep Research Comparator: A Platform for Fine-Grained Human Annotations of Deep Research Agents

Prahaladh Chandrahasan, Jiahe Jin, Zhihan Zhang, Tevin Wang, Andy Tang, Lucy Mo, Morteza Ziyadi, Leonardo F. R. Ribeiro, Zimeng Qiu, Markus Dreyer, Akari Asai, Chenyan Xiong

WebConf 2026 Demo

PDF Code

RAG

GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation

Ionut-Teodor Sorodoc, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Christopher Davis, Adrià de Gispert

ACL 2025 Findings

PDF Code

RAG

XRAG: Cross-Lingual Retrieval-Augmented Generation

Wei Liu, Sony Trenous, Leonardo F. R. Ribeiro, Bill Byrne, Felix Hieber

EMNLP 2025 Findings

PDF

NeoQA: Evidence-Based Question Answering with Generated News Events

Max Glockner, Xiang Jiang, Leonardo F. R. Ribeiro, Iryna Gurevych, Markus Dreyer

ACL 2025 Findings

PDF Code

Model

The Amazon Nova Family of Models: Technical Report and Model Card

Amazon AGI, Amazon Artificial General Intelligence, et al.

arXiv, 2025

PDF Report

Align

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

NAACL 2025 Findings

PDF

Model

Amazon Nova 2: Multimodal Reasoning and Generation Models

Amazon AGI

Technical Report, 2025

Report

Model

Amazon Nova Premier: Technical Report and Model Card

Amazon AGI

Technical Report, 2025

Report PDF

2024

RAG

Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA

Nirmal Roy, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Kevin Small

EMNLP 2024 Findings

PDF

LLM

Speechworthy Instruction-Tuned Language Models

Hyundong Cho, Nicolaas Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May

EMNLP 2024

PDF

Tool

FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-Tracked Constrained Decoding and Reranking

Zhuoer Wang, Leonardo F. R. Ribeiro, Alexandros Papangelis, Rohan Mukherjee, Tzu-Yen Wang, Xinyan Zhao, Arijit Biswas, James Caverlee, Angeliki Metallinou

EMNLP 2024 Findings

PDF

Measuring Retrieval Complexity in Question Answering Systems

Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti

ACL 2024 Findings

PDF

Eval

On the Role of Summary Content Units in Text Summarization Evaluation

Marcel Nawrath, Agnieszka Wiktoria Nowak, Tristan Ratz, Danilo Constantin Walenta, Juri Opitz, Leonardo F. R. Ribeiro, Joao Sedoc, et al.

NAACL 2024

PDF Code

REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset

Vaidehi Patil, Leonardo F. R. Ribeiro, Mengwen Liu, Mohit Bansal, Markus Dreyer

ACL 2024

PDF

Edit

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts)

Luis Chiruzzo, Hung-yi Lee, Leonardo F. R. Ribeiro, editors

ACL 2024 Tutorial Abstracts

Volume PDF

2023

NLG

Generating Summaries with Controllable Readability Levels

Leonardo F. R. Ribeiro, Mohit Bansal, Markus Dreyer

EMNLP 2023

PDF Code

Robotics

Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning

Georgia Chalvatzaki, Ali Younes, Daljeet Nandha, An Thai Le, Leonardo F. R. Ribeiro, Iryna Gurevych

Frontiers in Robotics and AI, 2023

PDF Data

2022

Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking

Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych

EMNLP 2022

PDF ACL Anthology

UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

AACL 2022 Demo

PDF ACL Anthology

NLG

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, et al.

EMNLP 2022 Demo

PDF ACL Anthology

Eval

FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal

NAACL 2022

PDF Code

UKP-SQUARE: An Online Platform for Question Answering Research

Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, et al.

ACL 2022 Demo

PDF Code

Thesis

Graph-Based Approaches to Text Generation

Leonardo F. R. Ribeiro

Ph.D. thesis, Technical University of Darmstadt, 2022

2021

Graph

A Neural Graph-Based Local Coherence Model

Mohsen Mesgar, Leonardo F. R. Ribeiro, Iryna Gurevych

EMNLP 2021 Findings

PDF Code

AMR

Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation

Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang, Iryna Gurevych

EMNLP 2021

PDF Code

AMR

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Leonardo F. R. Ribeiro, Yue Zhang, Iryna Gurevych

EMNLP 2021

PDF Code Video

NLG

Investigating Pretrained Language Models for Graph-to-Text Generation

Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

NLP4ConvAI, EMNLP 2021

PDF Code Video

NLG

Modeling Graph Structure via Relative Position for Better Text Generation from Knowledge Graphs

Martin Schmitt, Leonardo F. R. Ribeiro, Philipp Dufter, Iryna Gurevych, Hinrich Schütze

TextGraphs, NAACL 2021

PDF Code

2020

NLP

Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

Anne Lauscher, Olga Majewska, Leonardo F. R. Ribeiro, Iryna Gurevych, Nikolai Rozanov, Goran Glavaš

DeeLIO, EMNLP 2020

PDF

NLG

Metaphoric Paraphrase Generation

Kevin Stowe, Leonardo F. R. Ribeiro, Iryna Gurevych

arXiv, 2020

PDF

NLG

Modeling Global and Local Node Contexts for Text Generation from Knowledge Graphs

Leonardo F. R. Ribeiro, Yue Zhang, Claire Gardent, Iryna Gurevych

TACL 2020

PDF Code Video

2019

AMR

Enhancing AMR-to-Text Generation with Dual Graph Representations

Leonardo F. R. Ribeiro, Claire Gardent, Iryna Gurevych

EMNLP 2019

PDF Code

Law

Análise e Ranqueamento da Rede de Advogados induzida por Processos Judiciais Trabalhistas

Leonardo F. R. Ribeiro, Daniel R. Figueiredo, Pedro R. Nascimento

BraSNAM 2019

PDF

Eval

Ranking Generated Summaries by Correctness: An Interesting but Challenging Application for Natural Language Inference

Tobias Falke, Leonardo F. R. Ribeiro, Prasetya Ajie Utama, Ido Dagan, Iryna Gurevych

ACL 2019

PDF Data

2018 and Earlier

Game

Performance of Monte Carlo Tree Search Algorithms when Playing the Game Ataxx

Leonardo F. R. Ribeiro, Daniel R. Figueiredo

ENIAC 2018

PDF

Graph

struc2vec: Learning Node Representations from Structural Identity

Leonardo F. R. Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo

KDD 2017

Project PDF Code

Law

Ranking Lawyers Using a Social Network Induced by Legal Cases

Leonardo F. R. Ribeiro, Daniel R. Figueiredo

Journal of the Brazilian Computer Society, 2017

PDF

Code

FlowTracker: Detecção de Código Não Isócrono via Análise Estática de Fluxo

Bruno R. Silva, Leonardo F. R. Ribeiro, Diego Aranha, Fernando M. Q. Pereira

CBSOFT 2015

PDF