Computer vision is an enabling technology that offers numerous benefits for brands and retailers, including ways to reduce friction, improve the customer experience, automate activities within the store and enhance operations.
In this report, part of our RetailTech series, we provide an overview of the technology and discuss its applications in retail, as well as key technology providers.
Computer vision offers several benefits for retailers and consumers. We believe that the technology can reinvent retail, enabling ubiquitous visual shopping, where a consumer can take a photo of an item on the street, on a digital screen (such as computer, mobile device or television) or in a magazine and then locate and purchase the item online or in a physical store.
The technology is steadily becoming cheaper and more powerful due to advances in computing power and the capabilities of software, fueled by artificial intelligence (AI) including deep learning, which uses software to identify objects. Since computer vision is software-based, potential users of the technology do not have to make heavy investments in computer hardware—apart from cameras, which are commodities. Computer-vision technology promises to enable a new generation of retail business models that are visually based.
Applications in Retail
Computer vision technology offers numerous functions and benefits to retail, including enabling payments, visual search and personalized product recommendations, as well as tracking inventory, enabling unstaffed stores and more. We summarize the applications of computer vision in retail in Figure 1 and explore selected examples in further detail below.
Figure 1. Applications of Computer Vision in Retail
[caption id="attachment_125266" align="aligncenter" width="550"] Source: Coresight Research[/caption]Visual Search and Personalized Product Recommendations
We believe that the power of visual search offers opportunities for retailers to spark a new level of consumer enthusiasm for shopping. Visual search enables consumers to search for an identical (or highly similar) item on a retailer’s website or app, based on a photo of an item captured in real life or from a fashion magazine, for example. The item can then be purchased in a physical store or online.
We envision several areas for the deployment of visual search technology:
Once an item is captured visually, software can match the attributes—such as brand, color, pattern or style—to tags that can be combined with consumer preferences to make future recommendations of similar or complementary items. The image below shows a demonstration of visual search from innovator Syte.ai.
[caption id="attachment_125267" align="aligncenter" width="200"] Visual searchRecent developments in visual search include the following:
Apparel Sizing
Improper fit of clothing is a leading motivation for consumers to make returns, which are expensive for retailers. Several innovators are using computer vision to tackle this issue. Solutions typically leverage smartphone cameras to take body measurements, which are then correlated with collected data in order to determine the correct size, taking into consideration the brand’s sizing and consumer preferences for a tight or loose fit.
For example, innovator 3DLOOK’s Mobile Tailor solution uses two consumer-generated smartphone photos to generate a digital 3D rendering of a body that incorporates up to 70 measurements.
Recent developments in apparel sizing include the following:
Facial Recognition
Although facial-recognition technology has become commonplace—such as in smartphone security processes or residential security systems—it offers promise for new applications in retail, including the following:
Like many new technologies, particularly those using AI, the use of facial-recognition technology generates several moral and ethical questions, particularly around privacy. While identifying consumers’ faces offers numerous opportunities for enhancing customer service, it does involve the loss of privacy and anonymity and can easily be misused.
Despite the power of the technology, privacy issues will likely prevent the use of facial recognition for identity verification in many countries. New York City passed an ordinance in March 2021 that requires businesses that collect, store, or share biometric information from customers to post a sign near all customer entrances that discloses the practice.
Walmart had tested the technology to identify shoplifters but dropped the test due to privacy concerns. The company had also experimented with computer-vision software for detecting customers’ emotions, but this was likely not deployed for the same reason. There is quite a bit of friction in the signing-in process in traditional and unstaffed stores that could be removed with facial recognition and it could possibly be implemented if consumers opt-in to agree to its use.
Recent developments in facial recognition include the following:
Inventory Tracking
Computer-vision systems for tracking inventory in real time in physical stores come in both fixed and mobile configurations.
Bossa Nova Robotics offers fixed cameras and drone-mounted cameras, which can be employed in combination. The technology company had supplied shelf-scanning robots to Walmart for five years, but the retailer ended the relationship in 2020.
[caption id="attachment_125270" align="aligncenter" width="500"] SmartSight robotic solutionRecent developments in inventory tracking include the following:
Unstaffed Stores
Unstaffed stores use computer vision for two different purposes: (1) to identify which items the shopper has picked up and plans to take out of the store; and (2) to maintain a real-time account of store inventory. Such stores employ “sensor fusion,” a combination of video cameras and weight sensors.
Companies developing cashierless-store technology include AiFi, Grabango, Standard Cognition, Trigo and Zippin.
Amazon has enhanced its technology to support larger stores than the initial convenience-store format, opening its first Amazon Go Grocery in February 2020. The company has offered its “Just Walk Out” technology to other retailers and has signed up its first customer: the Hudson Nonstop store at Dallas Love Field Airport.
[caption id="attachment_125271" align="aligncenter" width="500"] Ceiling in Amazon Go storeTo read related content, see our previous report on the accelerated development of the unstaffed retail business in China due to Covid-19, and our Retail Reimagined report on contact-light retail.
Recent developments in unstaffed stores include the following:
Collecting In-Store Data
Computer-vision systems can be used to collect a broader variety of data inside a physical store—in addition to items purchased—such as on consumer shopping behavior, which retailers can use to mitigate losses and generate other valuable insights. Computer-vision systems are able to collect several types of data for analysis:
Other Applications in the Store, Mall or Warehouse
Computer vision is a fundamental technology in autonomous robotics, including shelf-scanning robots as mentioned above. Robots can also perform many of the menial, repetitive tasks around a store or mall.
The Computer-Vision Market and Key Technology Providers
The computer-vision market in the US (which is roughly valued in the low-single-billion dollars) is dominated by industrial applications. We believe that retail represents less than 5% of this market, since retail applications are in their infancy. Retailers and other users make use of computer-vision applications, which use AI tools and run on AI platforms. These platforms can achieve high performance through the use of specialized AI chips, and computer-vision systems need cameras for obtaining images. The retail computer-vision value chain is illustrated in Figure 2.
Large Companies
US companies offering computer-vision software include Alphabet, Ambarella, CEVA, Cognex, Facebook, IBM, Intel, Leidos, National Instruments and Novanta. Global suppliers include Gemalto, NEC and Panasonic.
Major cloud software and service vendors such as Amazon AWS, Google Cloud and Microsoft Azure also offer APIs (application program interfaces) for image recognition.
Users of facial-recognition hardware and software include Apple and Alphabet (through its Android smartphone operating system).
China is making a push toward leadership in AI in general, and facial-recognition technology is more commonplace there. Technology companies possessing advanced computer-vision solutions include Alibaba, JD.com, Megvii and Tencent.
Innovators
We present selected innovators in computer vision in Figure 3.
Computer Vision as Part of the AI Tech Family
Computer vision aims to extract information from digital image and makes use of knowledge from the disciplines of biology, psychology, mathematics, physics and computer science.
Computer vision leverages the following general functions:
The technology is just one application of AI that offers benefits to retail (see Figure 4).
Neural networks are one technology for implementing image recognition using deep learning. The networks consist of a network of nodes, in which each seeks to mimic the function of cells in the human brain. Each node performs a mathematical function on its inputs, which is transmitted to its output. The network can be “taught” by observing the output value and adjusting the parameters on the nodes until the network provides the desired output for a given input. The term “deep” implies multiple hidden layers residing between the input layer and output layer. The output value needs to exceed some threshold value to represent an accurate recognition.
Retailers need to consider implementing visual search and other applications of computer vision in order to keep the interest of visually oriented consumers in a crowded landscape and also compete with other visual platforms such as livestreaming and shoppable video platforms.
Implications for Brands/Retailers
Implications for Real Estate Firms
Implications for Technology Vendors