Blanc Swan's picture

Blanc Swan PRO

blancsw

·

https://swan-blanc.fr/

AI & ML interests

ChatBot

Recent Activity

upvoted a paper 1 day ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 1 day ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

new activity 2 days ago

Infomaniak-AI/vllm-translategemma-4b-it:Any plan to release a 12B version?

View all activity

Organizations

upvoted 2 papers 1 day ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 28 days ago • 219

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 7 days ago • 143

upvoted a collection 2 days ago

TranslateGemma VLLM

Modified version of google/translategemma-4/12/27b-it optimized for deployment with vLLM. • 3 items • Updated 2 days ago • 1

upvoted 2 papers 10 days ago

Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published 22 days ago • 62

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 296

upvoted a collection 19 days ago

TranslateGemma

3 items • Updated 21 days ago • 204

upvoted a collection 24 days ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 325

upvoted a paper 24 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 30 days ago • 146

upvoted a paper about 1 month ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 217

upvoted an article about 2 months ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Dec 17, 2025

•

47

upvoted a paper about 2 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 120

upvoted 2 articles about 2 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

82

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

588

upvoted a paper 2 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 152

upvoted a collection 2 months ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 87

upvoted an article 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

291

upvoted 3 papers 2 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 51

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Paper • 2306.08568 • Published Jun 14, 2023 • 32

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10, 2025 • 10

upvoted a collection 2 months ago

Qwen3-Next

4 items • Updated Dec 31, 2025 • 177