PROJECTS
Systems I've Built
Production-grade AI systems and automation infrastructure, from real-time voice agents to self-hosted workflow engines.
Multilingual AI Calling Agent
Production-grade multilingual AI voice agent with native Bangla support. Handles live telephony calls with streaming speech processing, multi-turn dialogue, and real-time language switching.
Problem
Businesses in multilingual markets need voice agents that understand local languages. English-only solutions fail, and building native language AI requires specialized speech models.
Key Features
Tech Stack
Outcomes
- ✓First production Bangla voice agent
- ✓Multilingual customer experience
- ✓24/7 automated call handling
Social Media Automation Platform
Enterprise-grade automation system for social media management, AI-powered customer engagement, lead generation, and multi-channel notification orchestration.
Problem
Manual social media operations create bottlenecks, missed leads, and inconsistent customer engagement. Teams spend hours on repetitive tasks instead of strategy.
Key Features
Tech Stack
Outcomes
- ✓80% reduction in manual tasks
- ✓Zero-delay lead response
- ✓Unified multi-platform automation
BanglaSTT with BanglaSER
Real-time Bangla speech recognition system with speaker emotion recognition, optimized for telephony environments achieving 90%+ accuracy on live calls.
Problem
Bangla speech recognition lacks telephony-grade accuracy and emotion detection. Existing solutions fail on real-world call audio with noise and compression.
Key Features
Tech Stack
Outcomes
- ✓Production telephony deployment
- ✓Bangla language AI advancement
- ✓Real-time emotion insights
Real-Time Text-to-Speech Reader
Low-latency read-aloud experience that converts selected webpage text into natural, expressive speech with seamless playback and synchronized word highlighting.
Problem
Traditional TTS systems have high latency, unnatural voices, and no visual feedback. Users need real-time, engaging audio experiences.
Key Features
Tech Stack
Outcomes
- ✓Instant audio feedback
- ✓Enhanced reading experience
- ✓Accessibility improvement
Twilight Story Teller
End-to-end AI storytelling platform that transforms written stories into expressive, high-quality spoken narration with custom voice creation and style control.
Problem
Creating audiobook-quality narration requires expensive voice actors and studios. Content creators need accessible, customizable voice generation.
Key Features
Tech Stack
Outcomes
- ✓Professional narration quality
- ✓Custom voice branding
- ✓Scalable content production
Hybrid Multilingual Transcription Studio
GPU-accelerated desktop application for private, offline transcription of meetings and desktop audio with speaker-aware segmentation and multilingual support.
Problem
Cloud transcription tools create privacy, compliance, and reliability risks. Organizations need local-first alternatives that handle noisy, mixed-language conversations.
Key Features
Tech Stack
Outcomes
- ✓Zero cloud dependency
- ✓Reliable meeting transcription
- ✓Production-ready desktop build
Offline Speech Processing System
GPU-accelerated desktop application for offline speech recognition with speaker diarization and neural noise suppression.
Problem
Cloud-based speech processing raises privacy concerns and requires constant connectivity. Organizations need offline solutions.
Key Features
Tech Stack
Outcomes
- ✓Complete data privacy
- ✓No cloud dependency
- ✓Production-ready installer
RosterBhai - Team Roster Management
Complete roster management system with real-time updates, shift swapping, and team scheduling. Built for efficiency and mobile accessibility.
Problem
Manual scheduling is time-consuming, error-prone, and creates communication gaps. Teams need centralized, real-time roster management.
Key Features
Tech Stack
Outcomes
- ✓70% faster scheduling
- ✓Reduced scheduling conflicts
- ✓Improved team coordination
DocSaaS - Smart ePrescription
Modern ePrescription platform with built-in medical database, patient analytics dashboard, and practice insights for healthcare providers.
Problem
Paper prescriptions are error-prone and don't provide practice insights. Doctors need digital tools with medical intelligence.
Key Features
Tech Stack
Outcomes
- ✓Faster prescriptions
- ✓Data-driven practice insights
- ✓Improved patient tracking
Ready to Automate?
Let's build intelligent systems that transform your operations. From AI voice agents to workflow automation, I deliver production-ready solutions.