amogneandualem commited on
Commit
9577b99
Β·
verified Β·
1 Parent(s): ee53cfb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -16
README.md CHANGED
@@ -9,24 +9,78 @@ app_file: app.py
9
  pinned: false
10
  ---
11
 
12
- # 🌍 Multilingual Image Describer
13
- *Powered by BLIP + YOLOv8 | UCAS @2025*
14
 
15
- This application analyzes images to detect objects and generate descriptions in multiple languages.
16
 
17
- ## Features
18
- - Upload images or use camera
19
- - Object detection with YOLOv8
20
- - Image captioning with BLIP
21
- - Support for multiple languages
22
- - Export results as JSON/TXT
23
 
24
- ## How to Use
25
- 1. Upload an image or use the camera
26
- 2. Select your preferred language
27
- 3. Click "Analyze" to process the image
28
- 4. View results and export if needed
29
 
30
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31
 
32
- [View configuration reference](https://huggingface.co/docs/hub/spaces-config-reference)
 
 
 
 
 
 
9
  pinned: false
10
  ---
11
 
12
+ # 🌍 Multilingual Image Describer with Real Translation
 
13
 
14
+ A powerful web application that analyzes images to detect objects and generate descriptions in **14 different languages** with real translation capabilities.
15
 
16
+ ## ✨ Features
17
+ - **Real-time Translation**: Uses Hugging Face NLLB model for accurate translations
18
+ - **Object Detection**: Powered by YOLOv8 with confidence scores
19
+ - **Image Captioning**: Generates natural language descriptions using BLIP
20
+ - **14 Languages**: English, Spanish, French, German, Chinese, Hindi, Arabic, Russian, Japanese, Korean, Portuguese, Italian, Amharic, Turkish
21
+ - **Export Results**: Download analysis as JSON or TXT files
22
 
23
+ ## πŸš€ How to Use
 
 
 
 
24
 
25
+ ### 1. Upload Image
26
+ - Click "Upload" to select an image file
27
+ - Use "Camera" to capture a photo
28
+ - Try "Sample" for a demo image
29
+
30
+ ### 2. Configure Settings
31
+ - **Select Language**: Choose from 14 available languages
32
+ - **API Token**: Optional Hugging Face token for better translation
33
+ - **Confidence**: Adjust object detection sensitivity
34
+
35
+ ### 3. Analyze
36
+ - Click "πŸš€ Analyze Image" to process
37
+ - View results in Description, Objects, and Export tabs
38
+ - Download results for later use
39
+
40
+ ## πŸ”§ Translation Setup
41
+
42
+ For best translation results:
43
+
44
+ 1. **Get a free Hugging Face token**:
45
+ - Visit [huggingface.co/settings/tokens](https://huggingface.co/settings/tokens)
46
+ - Create a new token (select "Read" permission)
47
+ - Copy the token (starts with `hf_`)
48
+
49
+ 2. **Enter token in the app**:
50
+ - Paste token in the "Hugging Face Token" field in sidebar
51
+ - Token is stored locally for your session
52
+
53
+ ## 🌐 Supported Languages
54
+
55
+ | Language | Code | Support |
56
+ |----------|------|---------|
57
+ | English | en | βœ… Full |
58
+ | Spanish | es | βœ… Full |
59
+ | French | fr | βœ… Full |
60
+ | German | de | βœ… Full |
61
+ | Chinese | zh | βœ… Full |
62
+ | Hindi | hi | βœ… Full |
63
+ | Arabic | ar | βœ… Full |
64
+ | Russian | ru | βœ… Full |
65
+ | Japanese | ja | βœ… Full |
66
+ | Korean | ko | βœ… Full |
67
+ | Portuguese | pt | βœ… Full |
68
+ | Italian | it | βœ… Full |
69
+ | Amharic | am | βœ… Full |
70
+ | Turkish | tr | βœ… Full |
71
+
72
+ ## πŸ“Š Technical Details
73
+
74
+ - **Image Models**: BLIP (captioning) + YOLOv8 (detection)
75
+ - **Translation**: Facebook NLLB-200-distilled-600M via Hugging Face Inference API
76
+ - **Framework**: Streamlit + Python 3.10
77
+ - **Deployment**: Hugging Face Spaces
78
+
79
+ ## πŸ› οΈ Development
80
 
81
+ ### Local Setup
82
+ ```bash
83
+ git clone https://huggingface.co/spaces/amogneandualem/amogne-vlm-LLM
84
+ cd amogne-vlm-LLM
85
+ pip install -r requirements.txt
86
+ streamlit run app.py