This is just a little utility for automatically transcribing the contents of a personal audio recorder. I use one for managing the noise in my ADHD-addled head, and the Whisper speech-to-text engine makes the transcription very easy.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Elf M. Sternberg e95ec93509 Moved to PDM 7 months ago
.gitignore Moved to PDM 7 months ago
.pdm.toml Moved to PDM 7 months ago A little too to assist with TCD transcription. 8 months ago Revised styling 8 months ago
pdm.lock Moved to PDM 7 months ago
pyproject.toml Moved to PDM 7 months ago Wrap transcription attempt in a try/except 7 months ago

Language: Python Topic: Utility

TCDump: Thought Capture Device Transcriber

My current TCD
I have ADHD and I use a small tape recorder to capture all of the ephemeral ideas and incidents that happen around me. The OpenAI project recently released [Whisper](, a speech-to-text engine that has surprisingly good recognition even with it's smallest model.

This script is extremely personal and specific: it effectively transcribes and dumps the contents of my tape recorder as text to stdout. That's out. It's extremely useful to me, especially in conjunction with the Unix tee program.

If you have a tape recorder that presents as a USB device, you could probably just make this work for you by:

  • editing the path to the repository of recordings
  • the regular expression describing the file name convention
  • the strptime format in converttcdfilename that turns the filename into a human-readable date.

Note that "Could not find the repo" is an error condition (exit status 1), but "No files found in repo" or "No files found meeting your criteria" or "No new files found since last transcription" are all exit status 0, matching the Unix tradition.


TCDump is Copyright Elf M. Sternberg (c) 2019, and licensed with the Mozilla Public License vers. 2.0. A copy of the license file is included in this folder.