2025-05-07
EchoType is a minimal audio-to-text tool using Spring Boot, React, and OpenAI's Whisper model. Upload a file and get instant transcription, simple, clean, and effective.
Upload audio files in seconds
Transcribe using OpenAI Whisper
Clean and responsive UI
EchoType started as a classic "what if I just..." kind of weekend idea. I had a break between courses, was curious about OpenAI's Whisper model, and thought, why not build something that turns audio into text? It wasn’t meant to be fancy, just a fun little experiment to learn something new. Since API credits don’t grow on trees (and neither does CSN), I knew this would be a one-time build. A quick project, just for me, and now, for my portfolio.
First things first: get Spring Boot talking to OpenAI. It took a few cups of coffee and a mild dependency meltdown, but I eventually got Whisper working locally. After that, I spun up a React app with Vite (because fast builds = happy dev). The goal was simple: one page, upload your file, and boom - transcription. I kept the design super clean to make sure the feature stood out, not the fluff.
Oh, the version drama. I picked Spring AI M2 thinking it would be smooth sailing, but my dependencies clearly had other plans—they wanted to live in the future. That meant cryptic errors, renamed methods, and a bit of confused staring at the screen. Then came the classic “CORS says no” moment when trying to connect frontend to backend. But hey, nothing a custom config can’t fix. In short: a lot of small bumps, but every one was a lesson in disguise.