Create your first automation in just a few minutes.Try Studio Web

Google Vision

Google Vision

by UiPath







Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy-to-use REST API.



Introduction to Google Cloud Vision API

The Google Cloud Vision API empowers developers to comprehend the content of images through the encapsulation of robust machine learning models within a user-friendly REST API.

It swiftly categorizes images into an extensive array of classifications, e.g., "sailboat," "lion," or "Eiffel Tower." Additionally, it adeptly identifies individual objects and faces within images, and proficiently locates and interprets printed words embedded within the visual content.

Initial Configuration for Using Google Vision Activities Package

Before using the the Google Vision Activities Package, you need to configure your applications using the Google Cloud Platform .

There are two configuration steps:

  1. Enable APIs - this step enables automation by granting API access to your Google applications.
  2. Create Credentials - this specifies the authentication type used to interact with your enabled APIs.

Activities in the Google Vision Activities Package

The activities included in this package are:

  • Google Vision Scope - Handles the authentication to the Google Vision service.
  • Annotate Image - Requests one or more annotation types per image.
  • Face Detection - Detects multiple faces within an image, along with the associated key facial attributes like emotional state or wearing head wear. Facial Recognition is not supported.
  • Handwriting Detection - Identifies and reads handwritten text inside the image.
  • Label Detection - Detects broad sets of categories within an image, ranging from modes of transportation to animals.
  • Logo Detection - Detects popular product logos within an image.
  • Safe Search - Detects explicit content like adult content or violent content within an image.
  • Text Detection - Detects and extracts text within an image, with support for a broad range of languages, along with support for automatic language identification.
  • Web Detection - Searches the internet for similar images.



Diverse Applications of Google Vision Activities

The Google Vision package facilitates comprehensive metadata analysis of images, offering diverse applications across various industries.

Potential Use Cases of Google Vision Activities

Potential use cases include:

  1. Label Detection: Identifying expansive categories within an image, spanning from modes of transportation to various animals.
  2. Explicit Content Detection: Discerning explicit content, such as adult themes or violent elements, embedded within an image.
  3. Logo Detection: Recognizing popular product logos present in an image.
  4. Optical Character Recognition (OCR): Detecting and extracting text within an image, with versatile language support and automatic language identification.
  5. Face Detection: Capably identifying multiple faces within an image.
  6. Web Detection: Harnessing the power to explore the internet for images similar to the one under analysis.

Additional Information

Additional Information


Newtonsoft.Json >= 12.0.3 Google.Apis.Vision >=

Similar Listings