Spaces:

amogneandualem
/

amogne-vlm-LLM

Running

App Files Files Community

amogneandualem commited on 5 days ago

Commit

9577b99

verified ·

1 Parent(s): ee53cfb

Update README.md

Browse files

Files changed (1) hide show

README.md +70 -16

README.md CHANGED Viewed

@@ -9,24 +9,78 @@ app_file: app.py
 pinned: false
 ---
-# 🌍 Multilingual Image Describer
-*Powered by BLIP + YOLOv8 | UCAS @2025*
-This application analyzes images to detect objects and generate descriptions in multiple languages.
-## Features
-- Upload images or use camera
-- Object detection with YOLOv8
-- Image captioning with BLIP
-- Support for multiple languages
-- Export results as JSON/TXT
-## How to Use
-1. Upload an image or use the camera
-2. Select your preferred language
-3. Click "Analyze" to process the image
-4. View results and export if needed
----
-[View configuration reference](https://huggingface.co/docs/hub/spaces-config-reference)

 pinned: false
 ---
+# 🌍 Multilingual Image Describer with Real Translation
+A powerful web application that analyzes images to detect objects and generate descriptions in **14 different languages** with real translation capabilities.
+## ✨ Features
+- **Real-time Translation**: Uses Hugging Face NLLB model for accurate translations
+- **Object Detection**: Powered by YOLOv8 with confidence scores
+- **Image Captioning**: Generates natural language descriptions using BLIP
+- **14 Languages**: English, Spanish, French, German, Chinese, Hindi, Arabic, Russian, Japanese, Korean, Portuguese, Italian, Amharic, Turkish
+- **Export Results**: Download analysis as JSON or TXT files
+## 🚀 How to Use
+### 1. Upload Image
+- Click "Upload" to select an image file
+- Use "Camera" to capture a photo
+- Try "Sample" for a demo image
+### 2. Configure Settings
+- **Select Language**: Choose from 14 available languages
+- **API Token**: Optional Hugging Face token for better translation
+- **Confidence**: Adjust object detection sensitivity
+### 3. Analyze
+- Click "🚀 Analyze Image" to process
+- View results in Description, Objects, and Export tabs
+- Download results for later use
+## 🔧 Translation Setup
+For best translation results:
+1. **Get a free Hugging Face token**:
+   - Visit [huggingface.co/settings/tokens](https://huggingface.co/settings/tokens)
+   - Create a new token (select "Read" permission)
+   - Copy the token (starts with `hf_`)
+2. **Enter token in the app**:
+   - Paste token in the "Hugging Face Token" field in sidebar
+   - Token is stored locally for your session
+## 🌐 Supported Languages
+| Language | Code | Support |
+|----------|------|---------|
+| English | en | ✅ Full |
+| Spanish | es | ✅ Full |
+| French | fr | ✅ Full |
+| German | de | ✅ Full |
+| Chinese | zh | ✅ Full |
+| Hindi | hi | ✅ Full |
+| Arabic | ar | ✅ Full |
+| Russian | ru | ✅ Full |
+| Japanese | ja | ✅ Full |
+| Korean | ko | ✅ Full |
+| Portuguese | pt | ✅ Full |
+| Italian | it | ✅ Full |
+| Amharic | am | ✅ Full |
+| Turkish | tr | ✅ Full |
+## 📊 Technical Details
+- **Image Models**: BLIP (captioning) + YOLOv8 (detection)
+- **Translation**: Facebook NLLB-200-distilled-600M via Hugging Face Inference API
+- **Framework**: Streamlit + Python 3.10
+- **Deployment**: Hugging Face Spaces
+## 🛠️ Development
+### Local Setup
+```bash
+git clone https://huggingface.co/spaces/amogneandualem/amogne-vlm-LLM
+cd amogne-vlm-LLM
+pip install -r requirements.txt
+streamlit run app.py