Embracing Podcast Stories with AI and Python

Apr 10, 2025

Embracing Podcast Stories with AI and Python

Okay, folks, buckle up for a tale of tech woe and (hopefully) AI-powered redemption. I was seeking a final project for my final project for Kaggle’s 5-Day AI Intensive. All I kept thinking of is my long-delayed project to explore our mountain of podcast mp3s and transcripts.

Here at Rethink Next Labs we’re a tech company that, ironically, has a knack for creating truly epic tech storage and processing messes. Case in point: our podcast archive.

We’re sitting on a glorious mountain of 100 MP3 files, the digital echoes of nine years of podcast interviews across three different shows. Think of it as our own personal Library of Alexandria, except instead of scrolls, it’s a digital pile of audio. Our latest baby, the Creative Innovators podcast (seriously, give it a listen!), has only added to the fun, blessing us with three seasons of awesome chats and a delightful smorgasbord of transcriptions. You get the picture – it’s a lot.

For ages, our brilliant solution to this audio overload has been… well, throwing money at it. Transcription services, summarization wizards – they’ve all had a go. Wouldn’t it be helpful to have our own AI brain that could just talk to all this amazing content? Imagine asking it about recurring themes or the secret sauce of innovation, gleaned from years of conversations!

So, my goal? To turn this audio chaos into something super useful. We’re talking consistent transcripts (finally!), killer summaries, and the holy grail: clear career paths for every guest. Think cool infographics, maybe even an ebook and audiobook packed with career wisdom, all mined from the source. And this Kaggle AI Intensive? It felt like the universe saying, “Hey, your audio mess? Perfect final project material!”

Our “Project Pathway: Audio to Patterns and Interaction with Structured Career Path Knowledge” is basically our plan to drag ourselves into the 21st century (audio-wise, at least). We’re starting small, with four brave sample MP3s from Creative Innovators. Here’s the techy lowdown:

  • Herding the Digital Cats: Getting Python to actually see our four test .mp3 files and treat them like the valuable data they (sort of) are.
  • Function Fiesta: A whole bunch of Python functions getting ready to hit up the APIs for Whisper, Gemini, and whoever else will listen.
  • MP3 to WAV: The Great Conversion: Using PyDub and AudioSegment to turn our trusty .mp3s into .wavs. Apparently, it helps with chopping them up for the AI to munch on. Go figure.
  • Whisper Tiny’s Fast Chat: Using OpenAI’s “tiny” Whisper for quick transcriptions. Speed over perfect accuracy for now, even if it occasionally sounds like our guests are speaking in tongues.
  • Gemini’s Brainy Bits: Letting Gemini AI loose on the transcripts to pull out the key takeaways in three neat little bullet points. Fingers crossed it gets the good stuff.
  • Prompting for Career Gold: Basically, teaching Gemini how to be a career path detective, digging through the transcripts for those pivotal moments.
  • Career Path Unlocked!: Getting Gemini to actually map out those career journeys in a way that makes sense (even to us!).
  • SQLite’s Secret Stash: Dumping all this processed goodness into an SQLite database. Gotta keep things tidy, even if it’s just digital tidiness.
  • Visual Extravaganza (Planned): Dreaming of using Plotly, Wordcloud, Seaborn, and Graphviz to turn boring data into pretty pictures. Think career timelines that don’t make your eyes glaze over!
  • Sharing is Caring (Eventually): Making sure all this hard work can play nice with other AI tools and our own systems. No digital islands allowed!

Now, being the tech-savvy folks we are (ahem), this journey hasn’t been without its… learning curves. My personal coding skills are best described as “enthusiastic amateur.” Last year, I learned C# to be able to work in Unity, but otherwise code in HTML and back in the day with Fortran punchcards. So, yeah, a lot of this code is lovingly borrowed and Frankensteined together with help from Gemini, my awesome NotebookLM sidekick, and the ever-patient ChatGPT.

But every tech stumble is a chance to learn, right? And the potential here is genuinely cool. Imagine researchers finding hidden patterns in how people tell their stories, marketers visualizing customer journeys, or us just being able to ask our AI brain, “Hey, what are the common threads in how our most innovative guests built their careers?” That’s the dream! And doing it ourselves means we get to build it our way, quirks and all.

The future’s looking bright (and hopefully filled with fewer audio-related headaches). This Kaggle AI Intensive project is just the first step in our quest to tame the podcast beast and finally bring our audio archive into the AI age. Stay tuned for more tales of tech triumphs (and likely a few more coffee-augmented mistakes along the way).

Tidbits and Updates

Screen x Screen Online Tools – Tidbits and Notes

Screen x Screen Online Tools – Tidbits and Notes

Thanks for joining today's session at the ScreenxScreen Virtual Conference on Online Tools.  Online Tools are important and potentially both beneficial and distracting for artists and those who create engagement and content with them.    Background Links and...

read more
MONDO.NYC

MONDO.NYC

Upcoming Events MONDO.NYC October 13-16, 2020 - ONLINECelebrating its fifth Anniversary, #MondoNYC is a four-day global interactive meeting and livestream conference and showcase festival, a vital pipeline of information, connectivity, and curation of great new music,...

read more
Creating Fearlessly in Virtual Reality

Creating Fearlessly in Virtual Reality

Please enjoy the "Innovating Music" podcast that our Executive Director produced in association with her role at the UCLA Center for Music Innovation at the Herb Alpert School of Music. This Episode: How do you create new and...

read more
Change Stories: Change How We Work and Decide

Change Stories: Change How We Work and Decide

We enjoyed sharing thoughts with the US Housing and Urban Development (HUD) OCIO Learning Series.  This session was recorded in January and ran as a webinar on March 17.  You can find it now at http://portal.hud.gov/hudportal/HUD?src=/press/multimedia/videos and can...

read more
Reflecting on My SXSW Journeys

Reflecting on My SXSW Journeys

I spoke at SXSW Music again this year on my current favorite topic: Music 20/20 and how we can proactively affect the future. SXSW, however, is not just about speaking. It is about diving deeply into diverse ideas with diverse people. It is one of my annual...

read more
Listening Harder to Me from My Past

Listening Harder to Me from My Past

About this time each year, I look at my stuff. Goodwill Industries gets a lot of my physical stuff, and gets a lot more this year as two of my three kids are ensconced in colleges not in this same town. My third got the last of her college applications out yesterday...

read more