There are two types of people in this world: those who watch TV shows with captions on, and those who are weird. All joking aside, the importance of closed captions for video cannot be understated. In addition to being crucial for the deaf and hard-of-hearing, captions also are important when audio is unavailable or not clearly audible. Maybe you're watching a video in a public place and the audio is drowned out by ambient noise. Or maybe the person speaking in the video is using a microphone that isn't the best quality, or speaks with an accent or dialect that is unfamiliar to the viewer. Captions are a good thing. Unfortunately, captioning audio in a live stream is tricky. always Before we dig into the problem of captioning live streams, let's talk about semantics a bit. Did you know that there is a difference between the terms and ? describes as: closed caption subtitle HTML spec subtitles transcription or translation of the dialogue, suitable for when the sound is available but not understood (e.g. because the user does not understand the language of the media resource's audio track). Overlaid on the video. The spec describes as: captions Transcription or translation of the dialogue, sound effects, relevant musical cues, and other relevant audio information, suitable for when sound is unavailable or not clearly audible (e.g. because it is muted, drowned-out by ambient noise, or because the user is deaf). Overlaid on the video; labeled as appropriate for the hard-of-hearing. This means that when we talk about "closed captions" for live videos, we're usually referring to since usually include descriptive information. Think about a scene in a TV show where an actor gets in the car to leave home and says goodbye to their spouse. The caption for this scene might read "Goodbye, dear. [car engine starts]." subtitles captions We're not close to having AI systems describe contextual information like this for us, so we're limited to adding pure "speech-to-text" subtitles captions to our live stream; we can do that using the method below. You’ll notice that the title and body of this blog post uses the terms ‘captions’ or ‘closed captions’ even though what we’re really talking about here are subtitles based on the definitions above. Unfortunately, since the term ‘closed captions’ is so commonly misused, it makes the most sense to use this term improperly to help developers find this blog post and learn how to add this feature to their live streams. Just know that what we’re really talking about here are subtitles! Note: Adding Captions to Amazon IVS Live Streams The solution that we look at in this post focuses on broadcasting to an Amazon Interactive Video Service (Amazon IVS) live stream from . OBS doesn't offer native support for captioning, but there are several plugins that can perform the necessary speech-to-text conversion and publish the captions to an RTMP stream in the . OBS Studio CEA-708/EIA-608 format supported by Amazon IVS For this demo, I've chosen to use the by ratwithacompiler ( and ). To get started with this plugin, and . Once you've got it installed in OBS, select and make sure the dock is enabled. OBS-captions-plugin GitHub plugin page download it install it Docks Captions Next, select the 'gear' icon in the dock to modify the settings. Captions Make sure that a is selected, and modify the plugin configuration to suit your needs. For example, the default for me was set to seconds, but I found seconds to be a better value. Caption Source Caption Timeout 15.0 5.0 Once you've saved your configuration and started a new live stream, the plugin handles converting your speech to text and produce the required caption information to the live stream. To play back the caption data with the Amazon IVS player, we can add an event listener to listen for the event ( ). TextCue docs ivsPlayer.addEventListener(IVSPlayer.PlayerEventType.TEXT_CUE, (evt) => {
  console.log(evt);
} The handler as configured above logs all incoming events to the console. TextCue The property of the event contains the caption data. text TextCue With some HTML and CSS, we can render the caption data as an overlay on the element. This implementation is highly dependent on your needs, but you should take into account auto-hiding the overlay after a specified period of no caption data. <video> https://youtu.be/spFpCIqGSm8?embedable=true Summary In this post, we looked at how to use an OBS plugin to convert speech to text and publish that text as caption data on an Amazon IVS live stream.

The code in this story is for educational purposes. The readers are solely responsible for whatever they build with it.

Walkthroughs, tutorials, guides, and tips. This story will teach you how to do something new or how to do something better.

How to Add Closed Captions (CC) to Amazon IVS Live Streams

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

Adding Pre-Recorded Video to Live Streams with Amazon IVS Web Broadcast SDK

How to Broadcast to Your Amazon IVS Live Stream From a Browser

The Noonification: Watch Web 2.5 Documentary Trailer (5/23/2023)

69 Stories To Learn About Live Streaming

Adding Pre-Recorded Video to Live Streams with Amazon IVS Web Broadcast SDK

A Quick Guide to LiveLike: How to Enhance Live Stream Interactivity

Adding Pre-Recorded Video to Live Streams with Amazon IVS Web Broadcast SDK

How to Broadcast to Your Amazon IVS Live Stream From a Browser

The Noonification: Watch Web 2.5 Documentary Trailer (5/23/2023)

69 Stories To Learn About Live Streaming

Adding Pre-Recorded Video to Live Streams with Amazon IVS Web Broadcast SDK

A Quick Guide to LiveLike: How to Enhance Live Stream Interactivity

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps