jackandahat: A brown otter, no text. (Default)
Jack ([personal profile] jackandahat) wrote in [community profile] accessibility_fail2011-05-07 08:41 am

YouTube, bless them.

They're trying. They're really trying. But after yet another person went on about how autocaptioning was "close enough", I grabbed the last video I'd had open and took a look at just how close the autocaptioning was to what I could hear. (Disclaimer - I am hard of hearing, but I lipread pretty well, so I think I know what they were saying.)

15 errors in 25 seconds of video.

Several of these totally changed the plot, some were just damn silly.

Expanded on here, now with bonus Cosby Show!

[personal profile] teafeather 2011-05-07 11:25 am (UTC)(link)
I was really excited about auto-captions when they first came out, as I have auditory processing problems. Unfortunately most of the captions are so bad that they leave me feeling even more confused about what's being said and interfere with the bits I can hear clearly.
codeman38: Osaka from Azumanga Daioh questioning whether Apple's 'think different' slogan should be 'differently'. (think different)

[personal profile] codeman38 2011-05-07 03:03 pm (UTC)(link)
Yeah, same here! Pretty much every video I see people link that only has auto-captions is practically incomprehensible, so much so that my dodgy auditory processing can make more sense of it.

Personally, I'm amazed more people haven't taken advantage of the auto-timing feature, where YouTube automatically adds timestamps to a straight transcript. That works surprisingly well, in fact-- because hearing already transcribed words is a much less intractable problem than making sense of audio with no hints at all.
sophie: A cartoon-like representation of a girl standing on a hill, with brown hair, blue eyes, a flowery top, and blue skirt. ☀ (Default)

[personal profile] sophie 2011-06-08 07:25 am (UTC)(link)
I know I've definitely used it on one of my videos so far, and yes, it is pretty good :)
maewyn: a middle-aged obi-wan holds his goatee, looking worried and pensive (Default)

[personal profile] maewyn 2011-05-07 02:38 pm (UTC)(link)
I haven't done anything with autocaptioning, but I work as a closed captioner, and the number of times I have to back up the video and re-listen to figure out WTF people are saying... I am not surprised there are errors in the autocaptioning. I'd also guess that the error ratio would go up depending on how many speakers there are and whether they're using a specialized vocabulary (like on a home renovation show).
codeman38: Osaka from Azumanga Daioh, with a speech bubble reading 'Contemplation No. 1'. (contemplation)

[personal profile] codeman38 2011-05-07 03:04 pm (UTC)(link)
A few times, I've actually quoted the auto-captioning back at people just to give them an idea of how incomprehensible it is. It almost always comes as a surprise to them. (I mean, seriously. Do they not actually bother to try to watch things with captioning?)
codeman38: Osaka from Azumanga Daioh surrounded by Japanese kana, translated as 'Get it together!' (get it together)

[personal profile] codeman38 2011-05-07 03:07 pm (UTC)(link)
ARGH YES YES YES.

If what I hear is "mumble mumble donut shop mumble" and the captions include "don't shout" where I heard "donut shop", that obviously means I could make out the rest of the dialogue, right? >_<
codeman38: Osaka from Azumanga Daioh surrounded by Japanese kana, translated as 'Get it together!' (get it together)

[personal profile] codeman38 2011-05-07 03:38 pm (UTC)(link)
I had no idea, but I would never have gotten the correct transcript that you gave in your journal out of that!
maewyn: a middle-aged obi-wan holds his goatee, looking worried and pensive (Default)

[personal profile] maewyn 2011-05-07 03:14 pm (UTC)(link)
Oh, I totally get you and I'm not defending it. I meant that specialized vocab would be even worse.

With the sheer number of videos on YouTube, it'd cost buckets of money to hire a real person to caption them all. Somebody to *correct* all those auto errors, however, would do some real good.
maewyn: a middle-aged obi-wan holds his goatee, looking worried and pensive (Default)

[personal profile] maewyn 2011-05-07 03:30 pm (UTC)(link)
I am a little horrified that anyone could think you'd get accurate captioning just from software! Have these people spent any time at all with voice recognition software and how you have to speak just right, and train the software to your voice, and how the speaker(s) in your average YouTube video are doing neither of these things?

It could be awesome if users could report captioning errors, or even just flag a video in general for review of the captions, and then have someone go in to fix the errors. (I can dream, right?)
kyrielle: Middle-aged woman in profile, black and white, looking left, with a scarf around her neck and a white background (Default)

[personal profile] kyrielle 2011-05-07 04:51 pm (UTC)(link)
What would be great would be if someone watching a video and spotting captioning issues could provide an alternate transcript that would be stored for review by someone (so people couldn't, as mentioned above, put swear words on kiddie programs), and released once reviewed.
codeman38: Osaka from Azumanga Daioh: 'I live in my own little world, but it's OK... they know me here!' (own little world)

[personal profile] codeman38 2011-05-07 03:37 pm (UTC)(link)
The obvious solution to that would be moderation, so that the video producer could outsource them but would have to give the final OK for them to show up.

But of course, Google wouldn't bother to implement something like that, now, would they?

[personal profile] treeowl 2011-05-08 04:19 am (UTC)(link)
They would need a lot more infrastructure than you're suggesting. How would the moderators be chosen? There would need to be an awful lot of moderators. How would they prevent the knuckle-draggers from overwhelming the moderators with a flood of crud?
codeman38: Osaka from Azumanga Daioh surrounded by Japanese kana, translated as 'Get it together!' (get it together)

[personal profile] codeman38 2011-05-08 05:13 am (UTC)(link)
Ah, good point. I was thinking it could be handled just like comments, with the video editor doing all the moderation... but then I consider just how frustrating it is to have to delete horrible comments. -_-;;

[personal profile] treeowl 2011-05-08 06:45 am (UTC)(link)
It wouldn't be bad to offer that as an option to the video uploader, I suppose, but I'm not suer it's the best way to handle it.
codeman38: Osaka from Azumanga Daioh: 'I live in my own little world, but it's OK... they know me here!' (Osaka)

[personal profile] codeman38 2011-05-07 02:56 pm (UTC)(link)
It's also greatly affected by how much background noise there is. The voice-recognition system is trained for complete silence, and so of course it's completely thrown off if there's even the slightest bit of music in the background.
codeman38: Osaka from Azumanga Daioh: 'I live in my own little world, but it's OK... they know me here!' (Osaka)

[personal profile] codeman38 2011-05-07 03:05 pm (UTC)(link)
Yep! C'est moi. In fact, I'm the one who reported the missing captions to them.
nafs: red dragon on lavendar background - welsh or celtic style (Default)

[personal profile] nafs 2011-05-07 04:41 pm (UTC)(link)
Autocaptioning is still horrible on TV (seriously, the captioning on The Weather Network is so bad I sometimes suspect it's an actual person who is bored and figures no one is reading and so types nonsense to entertain themselves) , I don't really get how Youtube thought it was going to be better on the internet.
nafs: red dragon on lavendar background - welsh or celtic style (Default)

[personal profile] nafs 2011-05-07 05:59 pm (UTC)(link)
I don't know if they do in the States but on the Canadian it's basically the only explanation I can think of.
mercredigirl: Buffy Summers holding a rocket launcher (Buffy + rocket launcher)

[personal profile] mercredigirl 2011-06-01 07:47 am (UTC)(link)
I just ran into this, because my brain has difficulty focusing on unsubtitled video, and I had to look at videos for work... I am appalled by the auto-captioning. I mean, if it doesn't work, why unveil it?
mercredigirl: Screencap of Twi'lek Jedi Aaylas'ecura from Star Wars, kissing. (Default)

[personal profile] mercredigirl 2011-06-01 07:49 am (UTC)(link)
Maybe they didn't think people would actually need to use it. >
mercredigirl: Screencap of Twi'lek Jedi Aaylas'ecura from Star Wars, kissing. (Default)

[personal profile] mercredigirl 2011-06-01 07:54 am (UTC)(link)
I know transcribing takes spoons, so I don't want to ask transcripts of everyone, but I do wish that - for media corporation products or whatever, at least - transcripts could be made available. :(