UCLA Statistical Machine Learning Lab

university

thughost

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

thughost authored a paper 21 days ago

SeedFold: Scaling Biomolecular Structure Prediction

angelahzyuan authored a paper 12 months ago

Tensor Product Attention Is All You Need

thughost authored a paper about 1 year ago

Tensor Product Attention Is All You Need

View all activity

thughost

authored a paper 21 days ago

SeedFold: Scaling Biomolecular Structure Prediction

Paper • 2512.24354 • Published Dec 30, 2025 • 3

angelahzyuan

authored a paper 12 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

thughost

authored a paper about 1 year ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

FrankWu96

updated a model about 1 year ago

UCLAML/Mistral-7B-Instruct-ppo_mistral

Text Generation • 7B • Updated Nov 22, 2024 • 1

angelahzyuan

authored 2 papers about 1 year ago

Accelerated Preference Optimization for Large Language Model Alignment

Paper • 2410.06293 • Published Oct 8, 2024 • 5

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

thughost

authored a paper about 1 year ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

thughost

authored a paper over 1 year ago

DPLM-2: A Multimodal Diffusion Protein Language Model

Paper • 2410.13782 • Published Oct 17, 2024 • 22

zhiqings

authored 2 papers over 1 year ago

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

Paper • 2408.00724 • Published Aug 1, 2024 • 2

Lean-STaR: Learning to Interleave Thinking and Proving

Paper • 2407.10040 • Published Jul 14, 2024

thughost

authored 3 papers over 1 year ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Paper • 2409.06744 • Published Sep 10, 2024 • 8

FrankWu96

updated a model over 1 year ago

UCLAML/mistral-7b-expert-iteration-iter3

7B • Updated Aug 6, 2024 • 1

zhiqings

updated 2 datasets over 1 year ago

UCLAML/synthetic_data_mistral-7b-instruct-sppo-iter1_score

Viewer • Updated Aug 1, 2024 • 510 • 6

UCLAML/data-mistral-7b-instruct-sppo-iter1_generated

Viewer • Updated Aug 1, 2024 • 10 • 2

thughost

posted an update over 1 year ago

Post

755

We've open-sourced the code and models for Self-Play Preference Optimization (SPPO)! 🚀🚀🚀
🤗paper: Self-Play Preference Optimization for Language Model Alignment (2405.00675)
⭐ code: https://github.com/uclaml/SPPO
🤗models: UCLA-AGI/sppo-6635fdd844f2b2e4a94d0b9a

angelahzyuan

updated a dataset over 1 year ago

UCLAML/synthetic_data_mistral-7b-instruct-sppo-iter3_score

Viewer • Updated Jun 17, 2024 • 20.5k • 55

zhiqings

authored 2 papers over 1 year ago

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 28

AI & ML interests

Recent Activity

Team members 5

UCLAML's activity