How to Use Descript for Podcast Editing: AI-Powered Audio Workflow
I've been using Descript for over six months now, and I can confidently say it's transformed how I approach podcast editing. If you're tired of wrestling with traditional DAWs like Audacity or Logic Pro, Descript's AI-powered workflow will feel like a revelation. In this tutorial, I'll walk you through my exact process for editing a 30-minute podcast episode in under an hour.
What You'll Need
- A Descript account (free tier works, but I recommend the Pro plan for transcription hours)
- A recorded podcast episode (I'll use a WAV file for this example)
- Basic familiarity with audio editing concepts
Step 1: Import and Transcribe Your Audio
First, let's get your audio into Descript. I typically record with my co-host on Zoom, but you can import any audio file.
- Open Descript and click New Project.
- Name your project (I use "Episode 47 - AI in Healthcare").
- Drag your audio file into the project window or click Import.

Once imported, Descript automatically transcribes your audio. This takes about 2–3 minutes for a 30-minute episode. The transcription accuracy is impressive—I'd say 95%+ for clean recordings.
Pro Tip: If you have multiple speakers, Descript automatically detects them. But I recommend manually labeling each speaker in the transcript for better accuracy. Click on a speaker label and rename it (e.g., "John" or "Guest").
Step 2: Edit by Deleting Text, Not Audio
This is where Descript shines. Instead of cutting waveforms, you edit the transcript like a word processor.
- Remove filler words: Click the Filler Words button in the toolbar. Descript highlights all "ums," "uhs," "you knows," and other fillers.
- Click Remove All to delete them instantly. Descript automatically removes the corresponding audio and smooths the remaining clips.

- Delete sections: Just highlight text in the transcript and press Delete. The audio disappears. For example, if you have a rambling tangent, select it and hit delete.
Common Pitfall: Removing filler words can sometimes create awkward pauses or cut off the beginning of the next word. Always listen to the result before moving on. Use the Undo button (Ctrl+Z) if needed.
Step 3: Use AI to Remove Dead Air and Pauses
Silence kills podcast momentum. Descript has a brilliant tool for this.
- Select the Silence tool from the left panel.
- Set your threshold—I use 0.5 seconds for natural-sounding edits.
- Click Remove All to delete pauses longer than that.

Pro Tip: Don't remove all silence! Natural breathing pauses are essential for conversational flow. I recommend keeping pauses under 1 second for a tight edit, but leave some for authenticity.
Step 4: Fix Mistakes with the "Regenerate" Feature
Ever stumble over a word or mispronounce something? Descript's AI can fix that.
- Find the mistake in the transcript.
- Highlight the incorrect word or phrase.
- Click the Regenerate button (the magic wand icon).
- Type the correct word in the popup.
- Descript generates new audio that matches your voice. It's uncanny how good this is.

I used this recently when I mispronounced "algorithm" as "al-go-rith-im." Descript fixed it seamlessly.
Common Pitfall: This works best for short corrections (1–3 words). For longer sections, the AI voice can sound slightly robotic. Use sparingly for natural results.
Step 5: Add Music and Sound Effects
Descript has a built-in library of royalty-free music and sound effects.
- Click the Media tab on the left.
- Browse or search for a track (I use "Upbeat Technology" for intros).
- Drag it into the timeline below your voice track.

Pro Tip: Use the Auto-Duck feature. Right-click the music track and select Auto-Duck. Descript automatically lowers the volume when someone speaks and raises it during pauses. This saves hours of manual volume automation.
Step 6: Polish with Audio Effects
Descript has decent built-in audio processing.
- Select your voice track.
- Click the Audio tab in the right panel.
- Apply Noise Reduction (I use 50% for room echo).
- Add Compression (set to "Voice" preset for consistency).
- Apply a High-Pass Filter at 80Hz to remove low rumble.

Common Pitfall: Don't over-process. Too much noise reduction creates a "tinny" sound. I keep it subtle—just enough to clean up background hum.
Step 7: Export Your Final Episode
You're almost done! Let's get this out into the world.
- Click File > Export.
- Choose Audio format.
- Select WAV for highest quality (or MP3 if you need smaller files).
- Set sample rate to 48kHz and bit depth to 24-bit for broadcast quality.
- Click Export.

Pro Tip: Descript also supports direct export to hosting platforms like Buzzsprout or Libsyn. Check the Publish tab for integrations.
Advanced Workflow: My Personal System
After months of use, here's my optimized process for a 30-minute episode:
- Import and transcribe (5 minutes)
- Remove filler words (2 minutes)
- Listen at 1.5x speed and delete tangents (15 minutes)
- Remove long silences (2 minutes)
- Fix major mistakes with Regenerate (5 minutes)
- Add intro/outro music with Auto-Duck (5 minutes)
- Apply audio effects (3 minutes)
- Final listen at 1x speed (30 minutes)
- Export (2 minutes)
Total: About 1 hour for a polished episode.
Conclusion
Descript has fundamentally changed how I edit podcasts. The AI-powered workflow—especially transcription-based editing and filler word removal—cuts my editing time by 60%. Here are the key takeaways:
- Edit text, not waveforms – This is the biggest time-saver. Deleting text removes audio seamlessly.
- Use AI tools sparingly – The Regenerate feature is amazing for small fixes but can sound artificial for longer sections.
- Auto-Duck is your best friend – It handles music ducking automatically, saving you hours of manual work.
- Always listen before exporting – Even with AI, human ears catch things software misses.
If you're still using traditional DAWs for podcast editing, give Descript a try. The free tier is generous enough to test it on a full episode. I promise you won't go back.
Happy editing!