Sea AI Lab

company

Verified

https://sail.sea.com

AI & ML interests

None defined yet.

Recent Activity

phyang authored a paper 23 days ago

Visual-ERM: Reward Modeling for Visual Equivalence

slionar submitted a paper 27 days ago

TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size

vermouthdky authored a paper 27 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

View all activity

Papers

TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size

Rethinking the Trust Region in LLM Reinforcement Learning

View all Papers

authored a paper 23 days ago

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published 26 days ago • 21

submitted a paper to Daily Papers 27 days ago

TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size

Paper • 2603.07988 • Published about 1 month ago • 2

authored a paper 27 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published about 1 month ago • 42

whyu

submitted a paper to Daily Papers about 1 month ago

NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

Paper • 2602.22144 • Published Feb 25 • 1

submitted a paper to Daily Papers about 2 months ago

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

Paper • 2602.10179 • Published Feb 10 • 6

authored a paper 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

submitted a paper to Daily Papers 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

authored a paper 2 months ago

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

authored 3 papers 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18, 2025 • 19

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1, 2025 • 10

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

submitted a paper to Daily Papers 2 months ago

HalluCitation Matters: Revealing the Impact of Hallucinated References with 300 Hallucinated Papers in ACL Conferences

Paper • 2601.18724 • Published Jan 26 • 7

submitted a paper to Daily Papers 2 months ago

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

authored a paper 3 months ago

SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving

Paper • 2601.01426 • Published Jan 4 • 24

authored 6 papers 5 months ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9, 2025 • 35

Electrocardiogram Instruction Tuning for Report Generation

Paper • 2403.04945 • Published Mar 7, 2024 • 2

Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models

Paper • 2404.02657 • Published Apr 3, 2024 • 2

UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers

Paper • 2301.13741 • Published Jan 31, 2023 • 1

D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Paper • 2406.13035 • Published Jun 18, 2024 • 3

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 56