SANGAM AI is an integrated AI platform that provides tools for text-to-text, text-to-image, text-to-speech, text-to-video, and speech-to-speech functionalities. This project combines pre-trained models from Hugging Face and other sources, presenting a streamlined interface for diverse AI-powered tasks.
- Text-to-Text: Generate text-based responses using advanced language models.
- Text-to-Image: Create images from textual descriptions.
- Text-to-Speech: Convert text into realistic voice outputs.
- Text-to-Video: Generate short video content from text prompts.
- Speech-to-Speech: Transform voice inputs into modified voice outputs or translations.
- Streamlit: Web-based user interface for seamless interaction.
- Flask: API and middleware for handling application logic.
- Python: Core language used for integrating AI tools.
- SQLite: Manages chat history and file storage.
- Pre-trained models from Hugging Face for text, speech, and image processing.
- Future deployment on AWS or Azure with auto-scaling capabilities.
- frontend/: Contains Streamlit UI files.
- backend/: Flask-based backend logic.
- models/: Pre-trained models for inference.
- database/: SQLite schema and chat history.
- scripts/: Utilities and helpers for handling prompts and file parsing.
-
Clone the repository:
git clone https://github.com/your-username/sangam-ai.git
-
Navigate to the project directory:
cd sangam-ai -
Set up virtual environments for each AI tool:
python -m venv venv_text python -m venv venv_image # Repeat for all tools. -
Activate the environment and install dependencies:
source venv_text/bin/activate pip install -r requirements.txt -
Run the application:
streamlit run frontend/app.py
- Open the app in your browser.
- Select the desired tool from the dashboard.
- Provide the input (text, speech, or files).
- Get results and download or save them as required.
This project is licensed under the MIT License.
MIT License
Copyright (c) 2024 Gautam
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
- Fork the repository.
- Create a feature branch:
git checkout -b feature-name
- Commit your changes:
git commit -m "Add new feature" - Push the branch:
git push origin feature-name
- Open a pull request.
For any queries or suggestions, feel free to reach out:
- Email: galvanmoto@gmail.com
- GitHub: galvanmoto
- Hugging Face for providing pre-trained AI models.
- Streamlit and Flask for creating interactive and responsive applications.
- Open-source contributors for valuable libraries and tools.