Building on HF

264 1002 959

Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

updated a collection about 2 hours ago

2026 January⛄️ - China Open Source Highlights

liked a dataset about 2 hours ago

MiniMaxAI/OctoCodingBench

posted an update about 2 hours ago

After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥 https://huggingface.co/stepfun-ai/Step-Audio-R1.1 ✨ Apache 2.0 ✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning

View all activity

Organizations

posted an update about 2 hours ago

Post

After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥

stepfun-ai/Step-Audio-R1.1

✨ Apache 2.0
✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning

posted an update 1 day ago

Post

582

We have a new heatmap live on huggingface now🔥

woojun-jung/open-source-release-heatmap-ko

Korean community built their own version to track labs that actively publish open work, inspired by Chinese open source heat map!

This is the open source community at its best ♥️

posted an update 2 days ago

Post

408

More lightweight multimodal models are coming 👀

StepFun has been focused on multimodal AI from the very beginning. Their latest release a new foundational model: STEP3-VL🔥
https://huggingface.co/collections/stepfun-ai/step3-vl-10b
✨ 10B - Apache2.0
✨ Leads in the 10B class and competes with models 10–20× larger

posted an update 2 days ago

Post

204

Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601

posted an update 2 days ago

Post

229

GLM-Image from Z.ai is out 🔥

It was fully trained on Ascend Atlas 800T A2 with MindSpore, probably the first SOTA multimodal model fully trained on domestic chips 👀

zai-org/GLM-Image

✨ Hybrid Architecture: combined autoregressive + diffusion design delivers strong semantic alignment with high-fidelity details
✨ Strong performance in long, dense, and multilingual text rendering
✨ MIT licensed (VQ tokenizer & ViT weights under Apache 2.0)
✨ Now live on Hugging Face inference provider 🤗

posted an update 3 days ago

Post

2541

From ChatGPT Healthcare to Claude for healthcare, AI in medicine is speeding up🚀

Now BaichuanAI joins with Baichuan-M3 🏥 an open medical LLM trained for clinical decision-making

https://huggingface.co/collections/baichuan-inc/baichuan-m3

✨ 235B - Apache2.0
✨ Lower hallucinations via Fact-Aware RL
✨ Built for long medical chats

2 replies

replied to their post 3 days ago

Not sure, they didn't mention it on the model page.

replied to their post 3 days ago

https://huggingface.co/spaces/zh-ai-community/2025-ai-timeline

posted an update 4 days ago

Post

2759

AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB
openbmb/AgentCPM-Explore
✨ 4B - Apache2.0
✨ Supports 100+ multi-turn environment interactions with search + verification
✨ Full training/inference stack is openly shared as well

posted an update 4 days ago

Post

2531

Based on 2025 Chinese AI Timeline, here are some interesting takeaways:

✨ DeepSeek cadence: They shipped almost every month! (except Feb 2025)

✨ Qwen trajectory: Not a single “hit” model, but an expanding product line. VL/Math/Coder/Reranker/Embedding/Omni/Next/Image

✨ Multimodal trend: Steadily rising share, shifting from generation to editing + tooling.

✨ Reasoning as a main track: more engineered, system-level reasoning.

✨ From foundation to components: growth in infra models (embeddings, rerankers, OCR, speech) signals a move toward deployable stacks.

✨ Ecosystem broadening: more players beyond the top labs.

Follow for more updates👉

zh-ai-community

2 replies

posted an update 4 days ago

Post

241

Spirit AI (千寻智能) shared its VLA foundation model Spirit v1.5 on huggingface 🔥

It’s now No. 1 on RoboChallenge’s Table30 leaderboard, beating Pi0.5 🏆
Spirit-AI-robotics/Spirit-v1.5

posted an update 8 days ago

Post

1473

Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0

4 replies

posted an update 8 days ago

Post

2089

Qwen just released two new model series: Qwen3-VL-Embedding & Qwen3-VL-Reranker 🚀

✨ 2B / 8B - Apache2.0
✨ 30+ languages
✨ Supported text, images, screenshots, videos, and arbitrary multimodal combinations

Qwen3-VL-Embedding: Flexible vector sizes (64–2048)
https://huggingface.co/collections/Qwen/qwen3-vl-embedding
Qwen3-VL-Reranker: Built for recall>rerank pipelines
https://huggingface.co/collections/Qwen/qwen3-vl-reranker

posted an update 9 days ago

Post

592

MOSS Transcribe Diarize 🔊 A multimodal model for Speaker-Attributed, Time-Stamped Transcription from OpenMOSS.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization (2601.01554)
OpenMOSS-Team/MOSS-transcribe-diarize

✨ Single-pass end-to-end SATS
✨ 128k context, ~90 min audio
✨ Robust to overlap & noise

1 reply

posted an update 9 days ago

Post

276

Daily Papers just got an AI reading assistant 🔥

You can ask any question you want: clarify a paragraph, get a short summary...all without leaving the page!

✨ Powered by HuggingChat + Hugging Face MCP server

posted an update 11 days ago

Post

1813

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

posted an update 11 days ago

Post

1915

MiniMax M2.1 blog is out🔥
https://huggingface.co/blog/MiniMaxAI/multilingual-and-multi-task-coding-with-strong-gen

Only a year into open source, MiniMax is already making a great impact. Not only through solid models/products, but also by how well the team uses community platforms like Hugging Face.

HF Teams, blogs, Daily Papers, Spaces as project pages, and always experimenting with new ways to engage. Super impressive!

posted an update 12 days ago

Post

3621

2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context

1 reply

posted an update 28 days ago

Post

769

Following up on LLaDA 2.0 , the paper is now out on Daily Papers🔥
It has sparked a lot of discussion in the community for showing how discrete diffusion LLMs can scale to 100B and run faster than traditional AR models.
LLaDA2.0: Scaling Up Diffusion Language Models to 100B (2512.15745)

posted an update about 1 month ago

Post

4610

Finch 💰 an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work.

FinWorkBench/Finch

✨ Built from real enterprise data (Enron + financial institutions), not synthetic tasks
✨ Tests end-to-end finance workflows
✨ Multimodal & cross-file reasoning
✨ Expert annotated (700+ hours) and genuinely challenging hard

Adina Yakefu

AI & ML interests

Recent Activity

Organizations

AdinaY's activity