🩺 Multimodal Medical Product Analysis System

An end-to-end AI-powered multimodal system designed to analyze medical product images using OCR and Large Language Models (LLMs) to generate structured medical insights in both English and Arabic.

This project demonstrates practical skills in Computer Vision, NLP, and AI system design, with a focus on real-world medical use cases.

🚀 Key Features

OCR-based text extraction from medical product images
Text preprocessing and normalization
LLM-powered medical analysis and decision generation
Bilingual output (English & Arabic)
Interactive web interface using Gradio
Modular and scalable pipeline design

🧠 System Pipeline

Image Input
OCR Extraction
Text Preprocessing
Prompt Engineering
LLM-based Analysis
Bilingual Output
Gradio Interface

🛠️ Technologies Used

Python
OpenCV
Tesseract OCR
Hugging Face Transformers
Large Language Models
Gradio
NumPy

▶️ How to Run

pip install -r requirements.txt python app/app.py

⚠️ Medical Disclaimer

This project is for educational and research purposes only and does not replace professional medical advice.

👤 Author

Mina Nabil

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
assets		assets
notebook		notebook
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🩺 Multimodal Medical Product Analysis System

🚀 Key Features

🧠 System Pipeline

🛠️ Technologies Used

▶️ How to Run

⚠️ Medical Disclaimer

👤 Author

About

Uh oh!

Releases

Packages

Languages

License

the0king0mina/Multimodal-Medical-AI-System

Folders and files

Latest commit

History

Repository files navigation

🩺 Multimodal Medical Product Analysis System

🚀 Key Features

🧠 System Pipeline

🛠️ Technologies Used

▶️ How to Run

⚠️ Medical Disclaimer

👤 Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages