Ingrid Tveten
ingridtv
·
AI & ML interests
Medical image analysis and machine learning
Recent Activity
updated
a collection
28 days ago
Document understanding
updated
a collection
about 1 month ago
Document understanding
updated
a collection
about 1 month ago
GenAI/LLM
Organizations
None yet
Medical images, encoding
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 257k • 1.55k -
microsoft/Phi-4-mini-instruct
Text Generation • 4B • Updated • 144k • 649 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 123 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133
Medical LM, Specific
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 2.59M • • 1.42k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.37k • 350 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 40.3k • 38 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 79.5k • 226
Document understanding
Medical LM, Specific
Medical images, encoding
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 2.59M • • 1.42k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.37k • 350 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 40.3k • 38 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 79.5k • 226
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 257k • 1.55k -
microsoft/Phi-4-mini-instruct
Text Generation • 4B • Updated • 144k • 649 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 123 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133