An illustration of a metalens designed for compact depth sensing. It consists of subwavelength-spaced square nanopillars. By alternating two different nanopillar patterns, visualized here in red and blue, this metalens forms two images at the same time. The two images mimic the images captured by the layered retinae in the eyes of jumping spiders. (Image courtesy of Qi Guo and Zhujun Shi/Harvard University)

Spider-inspired Depth Sensor with Microrobotics, AR, and Wearable Applications

By Nicolette Emmino On Oct 29, 2019

Researchers can extract a lot of valuable information from evolution. That’s what happened when a Harvard engineering team looked at jumping spiders. These small arachnids have impressive depth perception despite their tiny brains, allowing them to accurately pounce on unsuspecting targets from several body lengths away.

The researchers at the Harvard John A. Paulson School of Engineering and Applied Sciences (SEAS) have used what they extracted from the study of these spiders to develop a compact and efficient depth sensor that could be used on board microrobots, in small wearable devices, or in lightweight virtual and augmented reality headsets. The device combines a multifunctional, flat metalens with an ultra-efficient algorithm to measure depth in a single shot.

“Evolution has produced a wide variety of optical configurations and vision systems that are tailored to different purposes,” said Zhujun Shi, a Ph. D. candidate in the Department of Physics and co-first author of the paper. “Optical design and nanotechnology are finally allowing us to explore artificial depth sensors and other vision systems that are similarly diverse and effective.”

What’s Wrong With Current Depth Sensors?

Many of today’s depth sensors, such as those in phones, cars and video game consoles, use integrated light sources and multiple cameras to measure distance. Face ID on a smartphone, for example, uses thousands of laser dots to map the contours of the face. This works for large devices with room for batteries and fast computers, but what about small devices with limited power and computation, like smart watches or microrobots?

Humans measure depth using stereo vision, meaning that we look at an object and each of our two eyes is collecting a slightly different image. Try this: hold a finger directly in front of your face and alternate opening and closing each of your eyes. See how your finger moves? Our brains take those two images, examine them pixel by pixel and, based on how the pixels shift, calculates the distance to the finger.

“That matching calculation, where you take two images and perform a search for the parts that correspond, is computationally burdensome,” said Todd Zickler, the William and Ami Kuan Danoff Professor of Electrical Engineering and Computer Science at SEAS and co-senior author of the study. “Humans have a nice, big brain for those computations but spiders don’t.”

Jumping spiders have evolved a more efficient system to measure depth. Each principal eye has a few semi-transparent retinae arranged in layers, and these retinae measure multiple images with different amounts of blur. For example, if a jumping spider looks at a fruit fly with one of its principal eyes, the fly will appear sharper in one retina’s image and blurrier in another. This change in blur encodes information about the distance to the fly.

In computer vision, this type of distance calculation is known as depth from de-focus. But so far, replicating Nature has required large cameras with motorized internal components that can capture differently-focused images over time. This limits the speed and practical applications of the sensor.

That’s where the metalens comes in.

Federico Capasso, the Robert L. Wallace Professor of Applied Physics and Vinton Hayes Senior Research Fellow in Electrical Engineering at SEAS and co-senior author of the paper, and his lab have already demonstrated metalenses that can simultaneously produce several images containing different information. Building off that research, the team designed a metalens that can simultaneously produce two images with different blur.

Image shows the metalens depth sensor working in real-time to capture the depth of translucent candle flames. The two images on the left are the raw images captured on the camera sensor. They are formed by the metalens and are blurred slightly differently. From these two images, the researchers compute the depth of the objects in real time. The image on the right shows the computed depth map. (Courtesy of Qi Guo and Zhujun Shi/Harvard University)

“Instead of using layered retina to capture multiple simultaneous images, as jumping spiders do, the metalens splits the light and forms two differently-defocused images side-by-side on a photosensor,” said Shi, who is part of Capasso’s lab.

An ultra-efficient algorithm, developed by Zickler’s group, then interprets the two images and builds a depth map to represent object distance.

“Being able to design metasurfaces and computational algorithms together is very exciting,” said Qi Guo, a Ph.D. candidate in Zickler’s lab and co-first author of the paper. “This is new way of creating computational sensors, and it opens the door to many possibilities.”

“Metalenses are a game changing technology because of their ability to implement existing and new optical functions much more efficiently, faster and with much less bulk and complexity than existing lenses,” said Capasso. “Fusing breakthroughs in optical design and computational imaging has led us to this new depth camera that will open up a broad range of opportunities in science and technology.”