AI at Meta

company

Verified

https://ai.facebook.com/

facebookresearch

AI & ML interests

None defined yet.

Recent Activity

xhan77 authored a paper 5 days ago

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

xhan77 authored a paper 5 days ago

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

xhan77 authored a paper 5 days ago

TV2TV: A Unified Framework for Interleaved Language and Video Generation

View all activity

Papers

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Scaling Zero-Shot Reference-to-Video Generation

View all Papers

xhan77

authored 3 papers 5 days ago

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Paper • 2412.15188 • Published Dec 19, 2024 • 1

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

Paper • 2506.07999 • Published Jun 9 • 2

TV2TV: A Unified Framework for Interleaved Language and Video Generation

Paper • 2512.05103 • Published 11 days ago • 15

ythu

authored a paper 20 days ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 25 days ago • 112

alcinos

authored a paper 20 days ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 25 days ago • 112

eustlb

authored a paper 21 days ago

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8 • 10

pgleize

authored a paper 24 days ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published 25 days ago • 107

mortimerp9

authored a paper 3 months ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 35

yli1

authored a paper 4 months ago

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published Jul 29 • 36

JianyuanWang

authored a paper 5 months ago

SpatialTrackerV2: 3D Point Tracking Made Easy

Paper • 2507.12462 • Published Jul 16 • 18

xwen99

authored a paper 5 months ago

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11 • 61

marlamagka

authored 2 papers 6 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements

Paper • 2506.22419 • Published Jun 27 • 15

cointegrated

authored 4 papers 6 months ago

LCFO: Long Context and Long Form Output Dataset and Benchmarking

Paper • 2412.08268 • Published Dec 11, 2024

Large Concept Models: Language Modeling in a Sentence Representation Space

Paper • 2412.08821 • Published Dec 11, 2024 • 17

Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification

Paper • 2311.13937 • Published Nov 23, 2023 • 1

BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

Paper • 2502.04314 • Published Feb 6

ortal1602

authored 3 papers 6 months ago

AERO: Audio Super Resolution in the Spectral Domain

Paper • 2211.12232 • Published Nov 22, 2022 • 1

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Paper • 2406.10970 • Published Jun 16, 2024 • 1

HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing

Paper • 2407.07566 • Published Jul 10, 2024