Yuichi Tateno's picture

In a Training Loop 🔄

Yuichi Tateno PRO

hotchpotch

·

https://secon.dev/

AI & ML interests

Information Retrieval with LLMs

Recent Activity

upvoted an article about 8 hours ago

Introducing Storage Buckets on the Hugging Face Hub

liked a model 7 days ago

Kbenkhaled/Qwen3.5-35B-A3B-NVFP4

liked a model 7 days ago

Sehyo/Qwen3.5-35B-A3B-NVFP4

View all activity

Organizations

upvoted an article about 8 hours ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

3 days ago

•

147

upvoted a paper 15 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published 29 days ago • 22

upvoted a collection 20 days ago

ColBERT-Zero 🐶

First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 9 days ago • 17

upvoted a collection 23 days ago

Bharat-NanoBEIR: Indian Language Retrieval Benchmarks

NanoBEIR retrieval benchmarks translated into 22 Indian languages across 13 datasets. • 22 items • Updated Dec 13, 2025 • 5

upvoted an article about 1 month ago

Article

Transformers.js v4 Preview: Now Available on NPM!

Feb 9

•

76

upvoted a collection about 1 month ago

CoRNStack

State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated Mar 26, 2025 • 20

upvoted an article 2 months ago

Article

ModernVBERT: Towards Smaller Visual Document Retrievers

Oct 3, 2025

•

46

upvoted 2 collections 3 months ago

NanoBEIR datasets

These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated 10 days ago • 14

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 164

upvoted an article 3 months ago

Article

Granite 4.0 Nano: Just how small can you go?

Oct 28, 2025

•

123

upvoted 2 articles 4 months ago

Article

Streaming datasets: 100x More Efficient

+3

Oct 27, 2025

•

84

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

Jan 28, 2025

•

25

upvoted 3 articles 5 months ago

Article

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

+2

Oct 27, 2025

•

75

Article

Sentence Transformers is joining Hugging Face!

Oct 22, 2025

•

87

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1, 2025

•

138

upvoted 2 articles 6 months ago

Article

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

Sep 23, 2025

•

27

Article

mmBERT: ModernBERT goes Multilingual

+4

Sep 9, 2025

•

135

upvoted 3 articles 8 months ago

Article

Ettin Suite: SoTA Paired Encoders and Decoders

+4

Jul 16, 2025

•

78

Article

Migrating the Hub from Git LFS to Xet

+1

Jul 15, 2025

•

28

Article

Efficient MultiModal Data Pipeline

+3

Jul 8, 2025

•

70