Transcribing longer audio files

As lissyx said, we need more context.