Dense Captioning of Video Demonstrating the Upgraded Boston Dynamics Atlas Robot

Glen Tickle
July 6, 2016

Artist and programmer Gene Kogan ran the Boston Dynamics video demonstrating their upgraded Atlas robot through the Densecap captioning system, which tries to identify objects in a video. The system is both impressive and at times wildly inaccurate, labeling the robot in the resulting video as a variety of incorrect things like a person skiing, a motorcycle, or a fire hydrant.

Captions are generated by densecap on individual video frames. The video is made by a python script which merges matching captions along sequences of consecutive frames with a set of (mostly greedy) heuristics. Presumably, it would be possible to caption sequences of regions directly rather than a naive merging algorithm, but I’m not sure how :)

interestingly, densecap never mentions robots. atlas is variously described as person, motorcycle, fire hydrant, etc pic.twitter.com/roInNdoKKM

— Gene Kogan (@genekogan) July 1, 2016

via Prosthetic Knowledge

Glen Tickle

Amelia's dad. Steph's husband. Writer, comedian, gentleman. Good at juggling, bad at chess.

Dense Captioning of Video Demonstrating the Upgraded Boston Dynamics Atlas Robot

Related Posts

Glen Tickle

Managed WordPress at Laughing Squid Hosting

Related Posts

Glen Tickle

Recent Posts

An Animated Series About ‘Ada’, A New Graduate Who Explores the Ever-Evolving Role of Technology

Quentin Tarantino Explains How He Came to the Realization That He Wanted to Be a Filmmaker

Some of the Gnarliest Slang Terms From the 1980s

How Wisdom Teeth Were Passed Down to Humans From Primate Ancestors With Larger Mouths

An Adorable Compilation of Sleepy Raccoons

A Three Hour Special Featuring Hundreds of Musical Guests From 50 Years of ‘Saturday Night Live’

Managed WordPress at Laughing Squid Hosting