I have a large audiobook collection, in MP3 format, that I would like to transcribe into plain text. I need either a macOS desktop app or an in-browser transcription service (or something else that I'm not aware of).
The challenge: Real-time transcription is NOT an option, because the total duration of all audiobooks is around 125 hours. It is impractical for me to play back 125 hours of audiobooks and sit and listen to it all while transcription occurs. Also, I don't have another machine where I could perform the transcription. I have to do it on my MBP which I use for everyday work/entertainment.
I understand that most, if not all, transcription services/apps are real-time. I don't know much about how transcription works, so I'm wondering if it is even possible to do transcription in an accelerated manner ? i.e. give an app an mp3 file, have it analyze the audio (without needing to play it back real-time) and spit out a text file ? Does transcription need to be real-time in order for the transcription engine to understand individual words ? I'm hoping someone who knows about transcription can provide some info.
If there is a macOS framework/library that can help with this, I'm open to programming my own app to do this.
Help would be appreciated. Thanks.
The challenge: Real-time transcription is NOT an option, because the total duration of all audiobooks is around 125 hours. It is impractical for me to play back 125 hours of audiobooks and sit and listen to it all while transcription occurs. Also, I don't have another machine where I could perform the transcription. I have to do it on my MBP which I use for everyday work/entertainment.
I understand that most, if not all, transcription services/apps are real-time. I don't know much about how transcription works, so I'm wondering if it is even possible to do transcription in an accelerated manner ? i.e. give an app an mp3 file, have it analyze the audio (without needing to play it back real-time) and spit out a text file ? Does transcription need to be real-time in order for the transcription engine to understand individual words ? I'm hoping someone who knows about transcription can provide some info.
If there is a macOS framework/library that can help with this, I'm open to programming my own app to do this.
Help would be appreciated. Thanks.