Publications and Preprints

2025

Preprint

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Siyan Zhao*, Devaansh Gupta*, Qinqing Zheng, and Aditya Grover

preprint, 2025

arXiv Code Website
ICLR 2025

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Siyan Zhao, Mingyi Hong, Yang Liu, Devamanyu Hazarika, and Kaixiang Lin

ICLR, 2025

Oral Presentation, 1.8% acceptance rate

arXiv Code Website
AISTATS 2025

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

Siyan Zhao*, Daniel Israel*, Guy Van den Broeck, and Aditya Grover

AISTATS, 2025

arXiv Code

2024

Preprint

MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants

Hritik Bansal, Daniel Israel*, Siyan Zhao*, Shufan Li, Tung Nguyen, and Aditya Grover

preprint, 2024

arXiv Code
NeurIPS 2024

Probing the Decision Boundaries of In-context Learning in Large Language Models

Siyan Zhao, Tung Nguyen, and Aditya Grover

NeurIPS, 2024

Best Paper Runner-Up at the NeurIPS 2024 MINT Workshop

arXiv Code
ICLR 2024

Group Preference Optimization: Few-Shot Alignment of Large Language Models

Siyan Zhao, John Dang, and Aditya Grover

ICLR, 2024

arXiv Code Website

2023

NeurIPS 2023

Decision Stacks: Flexible Reinforcment Learning Via Modular Generative Models

Siyan Zhao, and Aditya Grover

NeurIPS, 2023

arXiv Code Website

2022

ICRA 2022

Object Insertion Based Data Augmentation for Semantic Segmentation

Yuan Ren, Siyan Zhao, and Bingbing Liu

ICRA, 2022

PDF

2020

Preprint

One Demonstration Imitation Learning

Bradly C. Stadie*, Siyan Zhao*, Qiqi Xu, Bonnie Li, and Lunjun Zhang

preprint, 2020

PDF