The Ultimate Guide to Optimizing YouTube Closed Captions for E-commerce [Step-by-Step Tutorial]

The Ultimate Guide to Optimizing YouTube Closed Captions for E-commerce [Step-by-Step Tutorial]

Share on

The emergence of YouTube as an e-commerce selling tool has been well documented on ReelSEO as far back as 2010 with “Video E-Commerce and Video SEO for Retailers. Many experts have blogged about using the power of YouTube to promote products and burnish brands. My employer, Transamerican Auto Parts, already took its YouTube marketing strategy seriously as a tool to bring more conversions from web shoppers visiting our site, When I came on as an SEO Specialist, I pushed the company to take its YouTube strategy seriously as an SEO tool to bring more web surfers to the website in the first place. The company agreed and the results have been positive and quantifiable.

The video department was consistently following all the proven techniques for how to shoot and edit video for YouTube. What was not yet being adopted into the workflow was optimizing the videos once they were uploaded.

In this post, I provide a tutorial with step-by-step instructions and example files, of how to use just one powerful tool to help optimize YouTube videos, Closed Captions.

Background recently took on a distributor with a large share of its business selling jeep winches located in South America. Due to road conditions Jeep winches are popular in that region, especially waterproof winches because of the tropical climate. The distributor asked if 4WD could make its product videos available in Spanish. first gathered quotes from full service solutions like VIA and Transperfect that perform every step from transcription, translation, and overdub to uploading the Spanish language version of the video.

As those solutions were being considered, YouTube announced a new service for automating video caption translation. 4WD decided that experimenting with this new YouTube functionality would be a great way to ease into making videos available in Spanish. This would allow to see how much of a quantifiable difference the translations made in distributor sales.

Why Add Captions?

  • Captions are searchable and increase SEO value.
  • Captions allow users to watch your video at work or other places where having the sound on might not be appropriate.
  • With YouTube’s built in translator, captions files can be added in English and then translated to other languages at the video viewer’s request.

Note: In this tutorial, we’re going to show you the proper method to creating a timed-text .SBV (Superbase Form Definition File) closed caption file for YouTube.  With YouTube’s auto-transcript feature, you can also upload a plain text version of your transcript which YouTube will then use to create a time-coded closed captions file based off their speech-to-text recognition technologies.  This is a simple way to add closed captions to your YouTube videos but keep in mind that just like machine transcription, it’s nowhere near perfect.  If you want to really optimize your closed captions for search, you’ll want to create and optimize your own SBV closed caption file as we’ll show you below.

Tutorial Materials and Instructions

The example files, including the example video “Smittybilt – X20 10,000 Pound Winch – Jeep Winches & Recovery” are downloadable from this Google Docs folder.

  • Log into your own YouTube channel.
  • Upload the example video to your channel.
  • You will need to stay logged in while completing the closed caption steps.

Adding Captions

Make sure you are logged in to your YouTube channel to complete the steps below using the example video.

Step 1 – Transcribe The Video Onto A Word Processing File

YouTube allows you to skip this step by using YouTube’s “machine transcription.” I would not advise using it. If you have music in your video, if the video has dialogue that takes place outside and the people speaking are not individually mic’d, if people speak over one another, if the people speaking are small children, if the people speaking talk even a tiny bit fast, the machine transcription is going to deliver wildly unreliable results. In fact, it is kind of a fun parlor trick to use the machine transcription just to laugh at what it claims the video dialogue is.

  1. Go to Smittybilt – X20 10,000 Pound Winch – Jeep Winches & Recovery video.
  2. Open any word processing file.
  3. Transcribe the dialogue from the video into the word processing file.
  4. When transcribing the video, separate each idea with a carriage return/Enter. This will make it easier to cut and paste the transcribed text into the SVB captioning template.
    Note: Word For Word Transcription Is Not Always Necessary.  It is not necessary to transcribe the dialogue word for word. The way people speak does not always look correct on the written page.
  5. Correct the grammar.
  6. Break up contractions. “You’ll love the new X20 because with it you can’t go wrong” should be transcribed to “You will love the new X20 because with it you cannot go wrong.” Breaking up contractions will help the Google Translator translate the caption file into other languages.
  7. Break up compound sentences. “If you frequently go off-roading in the snow, then the X20 winch is a must-have piece of equipment” should be broken up into “Do you frequently go off-roading in the snow? Then the X20 winch is a must-have piece of equipment.” This will also help Google Translator translate the caption file into other languages.
  8. Do not skip transcribing any jokes or any personal points made by the speaker. These make watching/reading the videos more engaging.
  9. Add keywords wherever it makes sense. Take the sentence above as an example. Replace “the X20 winch is a must have piece of equipment” with “the X20 winch is a must-have Jeep accessory.”

Use the product description that is on your e-commerce website as a guide to show you which product features are being highlighted. If the video you are transcribing is long, here is what I would suggest. Don’t bother transcribing the entire video. Watch the video once so you can see which product features are being highlighted at what point in the video. Go to the product description page and copy the product features into the word processing file you are using to transcribe the video.

Step 2 – Add Transcription Text To Template .SBV Caption File

Open the YouTubeSubtitleTemplate.txt file. WebDev-il did an excellent job of explaining what you should notice about the file:

“You’ll notice the first line of each new caption has the start time and end time. These times are separated by a comma (no spaces) in the format H:MM:SS.000 with milliseconds after the decimal point. This is followed by a line break and then the text (each line on a new line). A blank line (2 line breaks) indicates the end of the caption and the start of the next time code. This is very important. If you miss this blank line then YouTube doesn’t seem to understand your SBV file.”

The first thing that the speaker says in the example video is “Hello, Cole here”. Then he runs right into “Today I want to show you Smittybilt’s X20 ten thousand pound winch.” The dialogue starts at 0:03 in the video and the word “winch” ends at 0:07.

So in the template the first line should be changed from


The second line should be changed from

“Hi, it's Craig here with 4 Wheel Drive Hardware (”
“Hello Cole here from”

The third line should be changed from


The fourth line should be changed from

“4 Wheel Drive Hardware ( has been in business since 1976.”
“Today I want to show you Smittybilt’s X20 ten thousand pound winch.”

Now why start the subtitles at 00:00:02.000 if the dialogue actually started at 0:03 in the video? Why add “from”?

We want the viewer to read the entire name of the winch, which is long. So we are going to give the sentence with the name of the winch its own space on the screen. Since the captions are searchable, we should try, whenever it makes sense, to add the name/website of the company at (at least) the beginning and end of the caption file. Any time the company is mentioned, we should type the name of the company and the website address in parenthesis after as it is in the template file:

“4 Wheel Drive Hardware ( has been in business since 1976.”

Keep the captions reasonably close to the time frame in which the words are spoken, but there is no need to be a stickler about it. In the video, from 0:34 to 0:43 the speaker actually says:

“So if your favorite trails have water crossing and mud pits or you frequently go wheeling in the snow or salt water, the Smittybilt X20 is a must have piece of equipment for your rig.”

Open the SmittybiltX20CaptionFile.SBV document to see how the dialogue was cut and pasted from the transcription file to the caption template.

Rather than crowding the screen with all the text of the quote above and potentially getting in the way of images of the X20 winch, I broke the sentence up. The breakup of the complex sentence also helps ensure better results from Google Translator.


Do your favorite trails have water crossings and mud pits?


Do you frequently go wheeling in the snow or salt water?


The Smittybilt X20 winch is a must-have piece of equipment for you rig.

Even though I broke up the sentence, I kept the captions in the overall time frame within which the sentence was actually spoken, even if the captions do not match up chronologically with the phrases spoken by the speaker.

Let’s say you are captioning a long video and you have chosen to skip transcribing the dialogue and just copy (and hopefully de-duplicate) the description from your e-commerce product page. Watch the video and make sure that you place the sentences describing the product features so that they line up with the screen images demonstrating those product features.

Step 3 – Upload Caption File

  1. Login to your YouTube account and go to
  2. Find the Smittybilt-X20 10,000 Pound Winch video and hit the EDIT button.
  3. On the video editing page you will see tabs at the top. One of the tabs will be CAPTIONS. Hit that tab.

  1. There will be two buttons to the right of the screen. The first button will be a blue one titled “Request Translation.” The second one will be a gray button titled “Upload caption on transcript file.” Choose the gray button.
  2. A screen will come up with a BROWSE button at the bottom. Click the button and select the SmittybiltX20CaptionFile.SBV document that you just downloaded from the example folder.
  3. Once the document is uploaded, another screen will come up that will look like this.

  1. Underneath the title of the document you just browsed through, there will be two radial selections available. The first one will read “Caption File,” the second one will read “Transcript File.” Select “Caption File.”
  2. In the Track name field type the full name of the product featured in the video. In this case “Smittybilt X20 10K LB Winch”
  3. Click the blue UPLOAD button.

Step 4 – Check Caption File Against The Video

  1. The next screen to come up will be the “Active tracks” screen, which will look like the image below. Select the name of the caption file you just uploaded and click it.

  1. A drop down dialogue box will appear with the captions in it separated by time stamps. Click the play button on the video. The captions in the dialogue box will turn bold as the video progresses. On the video player you will see the captions overlayed on the screen.

  1. Check to make sure that no individual caption is so wordy that it crowds out important images on the screen.
  2. Check to make sure that the amount of time the caption is on the screen is sufficient to read the entire caption.
  3. If issues are discovered, edit the SmittybiltX20CaptionFile.SBV, reupload it and check the captions again
  4. If no issues are discovered, click the blue “Done button”.

Step 5 – Translate the caption file

  1. On the video player at the bottom you will see a box with “cc” written in it for “closed caption”. The box will be red when the captions are on and gray when the captions are off.
  2. Click the “cc” box. A dialogue box will pop up. In the dialogue box select “Translate Captions”.

  1. Another dialogue box will pop up with a drop down menu of languages. Scroll down to “Spanish – Espanol” and click “ok”.

  1. When the video resumes play, the captions will be in Spanish as translated by the Google Translator.
  2. Check to make sure that no individual Spanish caption is so wordy that it crowds out important images on the screen.
  3. Check to make sure that the amount of time the Spanish caption is on the screen is sufficient to read the entire caption.
  4. If issues are discovered, edit the SmittybiltX20CaptionFile.SBV, reupload it and check the captions again.
  5. If no issues are discovered log out. You are now finished captioning the video.

Editing The Caption File

If some of the captions are too wordy or are crowding out important images on the bottom of the screen, there are some things to consider.

Idioms in language have two problems. They may take a lot of text to write out in another language, and they don’t translate well. For example, the U.S. baseball idiom “hit a home run” could wind up being “strike a house that is moving away quickly” in the translated language. It would be best to replace idioms in the dialogue with simpler language in the transcription file.

When making references to popular culture, use quotes. You don’t want to turn the U.S. television show titles “Homeland Security” and “Modern Family” into “Security of the nation where you were born” and “The family that is up to date” in some other language.

Important points to reiterate

  • The captions are not automatic. The viewer has to choose to see the captions. The viewer also has to click the closed caption box and choose to have the captions translated to another language.
  • The captions are translated by an algorithm. That is why it is so important that the transcription text have correct grammar, that contractions be broken up, and that complex sentences be broken up.
  • The caption file can be downloaded from the video edit page. So it is not necessary to save the caption file to one’s hard drive.
  • The captions need not exactly follow the dialogue. It is just important that they get across the main features of the product being promoted in the video.

Here’s the final product video, “Smittybilt – X20 10,000 Pound Winch – Jeep Winches & Recovery” with closed captions.  Check it out.


Video Industry

Share on

Read More Insights

©2021 Tubular Insights & Tubular Labs, Inc.