Suresh Veeragoni's picture

Suresh Veeragoni

veeragoni

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

upvoted 2 collections 10 months ago

Open-RS

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21, 2025 • 12

DiffRhythm

5 items • Updated May 8, 2025 • 15

upvoted 7 collections over 1 year ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15, 2025 • 123

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25, 2024 • 15

Llama 3.1

12 items • Updated Jul 23, 2024 • 13

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 11 days ago • 15

4M Models

Multimodal models from https://4m.epfl.ch/ • 17 items • Updated Mar 7, 2025 • 31

Magpie-Pro Datasets (Llama-3)

Dataset built with Meta Llama 3 70B. • 6 items • Updated Jan 13, 2025 • 16

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 10, 2025 • 149

upvoted a collection almost 2 years ago

Comparing DPO with IPO and KTO

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8, 2025 • 32

upvoted a paper about 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260

upvoted 2 collections about 2 years ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 69

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 147