Learn More


Published on June 6th, 2019 | by Emergent Enterprise


Amazon Go uses Synthetic Data to Train Cashierless Store Algorithms

Emergent Insight:
It’s interesting to find out that with all of the talk of so much data in the world sometimes AI designers don’t have the right data. Khari Johnson shares at VentureBeat that sometimes there is the need for synthetic data – data that is self generated. That was the case with Amazon Go so in order to get better they created synthetic data to strengthen their machine learning.

Original Article:
Image Credit: Khari Johnson / VentureBeat

Amazon shared more details today about Amazon Go, the company’s brand for its cashierless stores, including the use of synthetic data to intentionally introduce errors to its computer vision system. Challenges that had to be addressed before opening stores to kill queues include the need to make vision systems that account for sunlight streaming into a store, little time for latency delays, and small amounts of data for certain tasks.

Synthetic data is being used in a number of ways to power few-shot learningimprove AI systems that control robots, train AI agents to walk, or beat humans in games of Quake III.

“As our application improved in accuracy — and we have a very highly accurate application today — we had this interesting problem that there were very few negative examples, or errors, which we could use to train our machine learning models,” Dilip Kumar, VP of Amazon Go, said. “So we created synthetic datasets for one of our challenging conditions, which allowed us to be able to boost the diversity of the data that we needed. But at the same time, we have to be careful that we weren’t introducing artifacts that were only visible in the synthetic data sets, [and] that the data translates well to real-world situations — a tricky balance.”

Kumar spoke onstage today at the Amazon re:Mars conference, which stands for Machine Learning, Automation, Robotics, and Space.

Also announced today: Alexa Conversations for seamless interactions with Alexa skills using deep learning, new warehouse robots, and plans to begin drone deliveries in the coming months.

Synthetic data is also being used to improve Amazon drones and warehouse robots that made their debut today, Amazon worldwide consumer CEO Jeff Wilke told VentureBeat in a press conference. Echoing Kumar, Wilke said this requires engineers to perform audits to ensure desired outcomes take place.

“Nearly everything that we talked about that [during the keynote address] is using large datasets to train models and create inference gets better if you have simulated data. The problem, of course, is that you can end up with bias in simulated data and blind spots and edge cases and so on,” Wilke said.

Above: Amazon VP Dilip Kumar shares a map of a Go store onstage at re:Mars at the Aria Hotel in Las Vegas on June 5, 2019.

In addition to the use of synthetic data and trials with human customers, Amazon engineers made computer vision algorithms that utilize both geometry and deep learning to ensure Go stores accurately associate transactions with the right customer. To meet the need for crunching data from videos in real time, Amazon also created Kinesis, a service that has since been incorporated into AWS RoboMaker.

To continue reading, go here…

Tags: , , ,

About the Author

The Emergent Enterprise (EE) website brings together current and important news in enterprise mobility and the latest in innovative technologies in the business world. The articles are hand selected by Emergent Enterprise and not the result of automated electronic aggregating. The site is designed to be a one-stop shop for anyone who has an ongoing interest in how technology is changing how the world does business and how it affects the workforce from the shop floor to the top floor. EE encourages visitor contributions and participation through comments, social media activity and ratings.

Back to Top ↑