Subject Guides: Data Processing and Analysis: Transcribe audio and video

Transcribe audio and video

Interviews are usually documented in the form of video or audio. The library offers workshops and guidance in tools to transcribe interviews and to analyse the transcription.

Courses in transcribing

oTranscribe

oTranscribe
Transcribe audio and video (manually) more efficiently with keyboard shortcuts and without having to switch between several windows.

ELAN

ELAN is an open-source transcription tool for audio and video where you can use tiers for different speakers or for speech and gestures. It allows for segmentation and registration of silences and is a good option for e.g. conversation analysis and analysis of sign language.

ELAN (EUDICO Linguistic Annotator)

Automatic transcription

Interviews contain personal data, which limits which software is appropriate to use to transcribe the recording.

If the interview material does not contain sensitive personal data, a suitable transcription tool is available in Canvas Studio. It is available to Uppsala University students, researchers and staff under the Studio tab when you are logged into Studium. The tool is intended for captioning video but can also be used to transcribe audio and video. Keep in mind that the automatic transcription needs to be edited manually as the result of automatic transcription is not flawless. The transcript will be in a subtitle format, see instructions below for how to remove everything but the actual text.

There is currently no procured software for transcribing data containing sensitive personal data for Uppsala University. Researchers can apply for projects at UPPMAX and within that project use OpenAI Whisper, free of charge. As long as Whisper is used on Bianca (NAISS SENS) at UPPMAX, it can be used for sensitive data. For non-sensitive data, Snowy on UPPMAX can be used. No web based versions of Whisper are approved for transcribing interviews.

It is also possible for researchers to purchase the Sunet speech-to-text service directly from Amberscript provided that they also sign a personal data processing agreement (DPA/PUBA) with them.

How to clean an SRT file from Canvas Studio
A step by step tutorial for removing time stamps, line numbers etc from your transcript file.