Thomas Ferraz's picture

Thomas Ferraz

thomas-ferraz

·

thomas-ferraz

AI & ML interests

NLP in portuguese

Recent Activity

upvoted a paper 3 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

upvoted a paper 6 days ago

KaVa: Latent Reasoning via Compressed KV-Cache Distillation

new activity 15 days ago

juletxara/mgsm:Convert dataset to Parquet

View all activity

Organizations

upvoted a paper 3 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 5 days ago • 40

upvoted a paper 6 days ago

KaVa: Latent Reasoning via Compressed KV-Cache Distillation

Paper • 2510.02312 • Published Oct 2, 2025 • 2

upvoted a collection about 1 month ago

Gemma 3 Release

28 items • Updated Aug 11, 2025 • 605

upvoted an article 4 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

32

upvoted an article 12 months ago

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

Jan 28, 2025

•

25

upvoted 2 papers 12 months ago

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Paper • 2502.06772 • Published Feb 10, 2025 • 21

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3, 2025 • 18

upvoted 2 collections about 1 year ago

mHuBERT-147 models

Compact yet powerful multilingual speech representation models based on the HuBERT architecture. • 3 items • Updated Jun 4, 2024 • 8

EuroLLM

8 items • Updated 19 days ago • 42

upvoted 2 papers over 1 year ago

LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints

Paper • 2410.06458 • Published Oct 9, 2024 • 8

mHuBERT-147: A Compact Multilingual HuBERT Model

Paper • 2406.06371 • Published Jun 10, 2024 • 7

upvoted a collection about 2 years ago

Multilingual DistilWhisper

Multilingual Distilwhisper allows for better ASR performance in target languages by adding lightweight CLSR modules on top of whisper-small. • 3 items • Updated Mar 18, 2024 • 6

upvoted a paper about 2 years ago

DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts

Paper • 2311.01070 • Published Nov 2, 2023 • 3

upvoted a paper over 2 years ago

ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

Paper • 2201.01337 • Published Jan 4, 2022 • 2