new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jul 31

Submitted by

csuhan

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

·
7 authors

Submitted by

JingweiZuo

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

·
27 authors

Submitted by

ZarkLngeW

BANG: Dividing 3D Assets via Generative Exploded Dynamics

·
7 authors

3

Submitted by

kenchan0226

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

·
12 authors

Submitted by

nielsr

MetaCLIP 2: A Worldwide Scaling Recipe

·
16 authors

Submitted by

eliebak

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

·
199 authors

Submitted by

tulvgengenr

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

·
7 authors

Submitted by

xiaofanghf

Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision

·
8 authors

Submitted by

HenghuiDing

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

·
4 authors

Submitted by

tomhu

Repair-R1: Better Test Before Repair

·
3 authors

Submitted by

akhadangi

Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning

·
5 authors

Submitted by

jahnsonblack

DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation

·
7 authors