10 4

Josiah Aklilu

josaklil-ai

https://josaklil-ai.github.io/

AI & ML interests

computer vision & language for enhancing surgical practice

Recent Activity

upvoted a paper about 1 month ago

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

upvoted a paper 4 months ago

Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models

liked a model 6 months ago

facebook/Perception-LM-8B

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Paper • 2601.18207 • Published Jan 26 • 19

upvoted a paper 4 months ago

Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models

Paper • 2510.07048 • Published Oct 8, 2025 • 5

liked a model 6 months ago

facebook/Perception-LM-8B

Image-Text-to-Text • 10B • Updated Jul 14, 2025 • 966 • 65

liked a model 7 months ago

facebook/dinov3-vitb16-pretrain-lvd1689m

Image Feature Extraction • 85.7M • Updated Aug 19, 2025 • 675k • 107

upvoted an article 8 months ago

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23, 2025

•

liked a dataset 8 months ago

facebook/PE-Video

Viewer • Updated Apr 18, 2025 • 118k • 4.53k • 42

liked a dataset 9 months ago

jmhb/VidDiffBench

Viewer • Updated Mar 16, 2025 • 549 • 253 • 7

upvoted a paper 11 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

upvoted 2 papers 12 months ago

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

Paper • 2503.13399 • Published Mar 17, 2025 • 22

Video Action Differencing

Paper • 2503.07860 • Published Mar 10, 2025 • 33

upvoted a collection 12 months ago

Temporal Preference Optimization

Collection

Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated Jan 19, 2025 • 5

updated a dataset about 1 year ago

josaklil-ai/s1K

Viewer • Updated Feb 13, 2025 • 1.89k • 7

published a dataset about 1 year ago

josaklil-ai/s1K

Viewer • Updated Feb 13, 2025 • 1.89k • 7

updated a dataset about 1 year ago

josaklil-ai/s50K

Viewer • Updated Feb 13, 2025 • 50.4k • 14

published a dataset about 1 year ago

josaklil-ai/s50K

Viewer • Updated Feb 13, 2025 • 50.4k • 14

upvoted a paper about 1 year ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23, 2025 • 23

authored 2 papers about 1 year ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13, 2025 • 55

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Paper • 2501.03225 • Published Jan 6, 2025 • 7

upvoted 2 papers about 1 year ago