Skip to content

Open-source voice data collection platform for building inclusive voice datasets. Collaborative transcription with quality consensus. FastAPI + React + PostgreSQL.

License

Notifications You must be signed in to change notification settings

Onuronon-lab/Shrutik

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Shrutik (শ্রুতিক) - Voice Data Collection Platform

Shrutik Logo

Empowering Communities Through Voice Technology

License: CC BY-NC-SA 4.0 Python 3.11+ FastAPI React

Vision

Shrutik (শ্রুতিক) is an open-source crowdsourcing platform designed to democratize voice technology for underrepresented languages. Named after the Bengali word meaning "listener" or "one who hears," Shrutik bridges the digital divide by enabling communities to contribute their voices and build inclusive AI systems.

Why Shrutik?

In a world where voice technology is rapidly advancing, millions of speakers of regional and minority languages are left behind. Major voice assistants and speech recognition systems work poorly or not at all for languages like Bengali, Tamil, Urdu and hundreds of others spoken by billions of people.

Shrutik changes this narrative by:

  • Empowering Communities: Enabling native speakers to contribute voice data in their own languages
  • Preserving Linguistic Diversity: Capturing the rich variations, dialects, and accents within languages
  • Democratizing AI: Making voice technology accessible to underrepresented communities
  • Open Source: Ensuring the tools and data remain community-owned and accessible
  • Educational: Teaching communities about AI while they contribute to its development

📚 Documentation

📖 Complete Documentation - Comprehensive guides, API reference, and tutorials

Quick Links

💡 Tip: The documentation includes search functionality, mobile-friendly design, and copy-paste code examples!

✨ What Makes Shrutik Special

For Contributors

  • Intuitive Interface: Easy-to-use web platform for recording and transcribing voice data
  • Quality Assurance: Built-in consensus mechanisms ensure high-quality datasets
  • Gamification: Engaging experience with progress tracking and community leaderboards
  • Multilingual Support: Native support for multiple languages and scripts

For Researchers & Developers

  • Production-Ready: Scalable architecture built with FastAPI and React
  • Comprehensive API: RESTful APIs for integration with external systems
  • Advanced Analytics: Real-time monitoring and quality metrics
  • Export Capabilities: Multiple format support for dataset export

For Organizations

  • Self-Hosted: Complete control over your data and infrastructure
  • Customizable: Modular design allows easy customization for specific needs
  • Enterprise Features: Role-based access, audit trails, and performance monitoring
  • Docker Support: Easy deployment with containerization

Architecture

Shrutik is built with modern, scalable technologies:

  • Backend: FastAPI (Python) with PostgreSQL and Redis
  • Frontend: React with TypeScript and Tailwind CSS
  • Audio Processing: Librosa and PyDub for intelligent audio chunking
  • Background Jobs: Celery with Redis for async processing
  • Monitoring: Built-in performance monitoring and health checks

Contributing

We welcome contributions from developers, linguists, designers, and community members! Whether you're fixing bugs, adding features, improving documentation, or contributing voice data, every contribution matters.

Ways to Contribute:

  • Voice Data: Record and transcribe in your native language
  • Code: Backend, frontend, or infrastructure improvements
  • Documentation: Help improve our guides and tutorials
  • Localization: Translate the interface to new languages
  • Testing: Report bugs and help with quality assurance

See our Contributing Guide for detailed instructions.

Community

Join our growing community of contributors:

  • Discord: Join our server for real-time discussions
  • GitHub Discussions: Share ideas and ask questions
  • Twitter: Follow @OnurononAI for updates

📄 License

Shrutik is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. This means you can:

  • Use for learning and education - Study and understand the code
  • Modify and adapt - Create your own versions and improvements
  • Share for non-commercial purposes - Use in hobby projects and open source
  • Attribution required - Give credit to the original creator
  • No commercial use - Commercial use requires separate permission

If you modify and distribute this work, you must use the same license and provide attribution.

Acknowledgments

Shrutik is built on the shoulders of giants. We thank the open-source community and the researchers who have made voice technology accessible. Special recognition to communities worldwide who contribute their voices to make AI more inclusive.


Together, we're building a more inclusive digital future, one voice at a time.

Get StartedContributeCommunity

Releases

No releases published

Packages

No packages published

Contributors 7