Shrutik (শ্রুতিক) is an open-source crowdsourcing platform designed to democratize voice technology for underrepresented languages. Named after the Bengali word meaning "listener" or "one who hears," Shrutik bridges the digital divide by enabling communities to contribute their voices and build inclusive AI systems.
In a world where voice technology is rapidly advancing, millions of speakers of regional and minority languages are left behind. Major voice assistants and speech recognition systems work poorly or not at all for languages like Bengali, Tamil, Urdu and hundreds of others spoken by billions of people.
Shrutik changes this narrative by:
- Empowering Communities: Enabling native speakers to contribute voice data in their own languages
- Preserving Linguistic Diversity: Capturing the rich variations, dialects, and accents within languages
- Democratizing AI: Making voice technology accessible to underrepresented communities
- Open Source: Ensuring the tools and data remain community-owned and accessible
- Educational: Teaching communities about AI while they contribute to its development
📖 Complete Documentation - Comprehensive guides, API reference, and tutorials
- Getting Started - Set up Shrutik in 5 minutes
- Docker Setup - Run with Docker (recommended)
- Local Development - Native development setup
- Architecture - System design and components
- API Reference - Complete API documentation
- Contributing - How to contribute
- FAQ - Frequently asked questions
- Troubleshooting - Common issues and solutions
💡 Tip: The documentation includes search functionality, mobile-friendly design, and copy-paste code examples!
- Intuitive Interface: Easy-to-use web platform for recording and transcribing voice data
- Quality Assurance: Built-in consensus mechanisms ensure high-quality datasets
- Gamification: Engaging experience with progress tracking and community leaderboards
- Multilingual Support: Native support for multiple languages and scripts
- Production-Ready: Scalable architecture built with FastAPI and React
- Comprehensive API: RESTful APIs for integration with external systems
- Advanced Analytics: Real-time monitoring and quality metrics
- Export Capabilities: Multiple format support for dataset export
- Self-Hosted: Complete control over your data and infrastructure
- Customizable: Modular design allows easy customization for specific needs
- Enterprise Features: Role-based access, audit trails, and performance monitoring
- Docker Support: Easy deployment with containerization
Shrutik is built with modern, scalable technologies:
- Backend: FastAPI (Python) with PostgreSQL and Redis
- Frontend: React with TypeScript and Tailwind CSS
- Audio Processing: Librosa and PyDub for intelligent audio chunking
- Background Jobs: Celery with Redis for async processing
- Monitoring: Built-in performance monitoring and health checks
We welcome contributions from developers, linguists, designers, and community members! Whether you're fixing bugs, adding features, improving documentation, or contributing voice data, every contribution matters.
Ways to Contribute:
- Voice Data: Record and transcribe in your native language
- Code: Backend, frontend, or infrastructure improvements
- Documentation: Help improve our guides and tutorials
- Localization: Translate the interface to new languages
- Testing: Report bugs and help with quality assurance
See our Contributing Guide for detailed instructions.
Join our growing community of contributors:
- Discord: Join our server for real-time discussions
- GitHub Discussions: Share ideas and ask questions
- Twitter: Follow @OnurononAI for updates
Shrutik is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. This means you can:
- ✅ Use for learning and education - Study and understand the code
- ✅ Modify and adapt - Create your own versions and improvements
- ✅ Share for non-commercial purposes - Use in hobby projects and open source
- ✅ Attribution required - Give credit to the original creator
- ❌ No commercial use - Commercial use requires separate permission
If you modify and distribute this work, you must use the same license and provide attribution.
Shrutik is built on the shoulders of giants. We thank the open-source community and the researchers who have made voice technology accessible. Special recognition to communities worldwide who contribute their voices to make AI more inclusive.
Together, we're building a more inclusive digital future, one voice at a time.
