G06V10/426

SCENE GRAPH EMBEDDINGS USING RELATIVE SIMILARITY SUPERVISION
20220391433 · 2022-12-08 ·

Systems and methods for image processing are described. One or more embodiments of the present disclosure identify an image including a plurality of objects, generate a scene graph of the image including a node representing an object and an edge representing a relationship between two of the objects, generate a node vector for the node, wherein the node vector represents semantic information of the object, generate an edge vector for the edge, wherein the edge vector represents semantic information of the relationship, generate a scene graph embedding based on the node vector and the edge vector using a graph convolutional network (GCN), and assign metadata to the image based on the scene graph embedding.

BOARD DAMAGE CLASSIFICATION SYSTEM

A board damage classification system includes a Convolutional Neural Network (CNN) sub-engine and a Graph Convolutional Network (GCN) sub-engine that were trained based on digital images of structures that have experienced natural disasters. The CNN sub-engine receives a board digital image of a board, analyzes the board digital image to identify board features, and determines a board feature damage classification for the board features. The CGN sub-engine receives a board feature graph that was generated using the board digital image and that includes nodes that correspond to the board features in the board digital image, and defines relationships between the nodes included in the board feature graph. The board feature damage classification determined by the CNN sub-engine and the relationships defined by the GCN sub-engine are then used to generate a board damage classification that includes a damage probability for board features in the board digital image.

Artificial intelligence intra-operative surgical guidance system and method of use

The inventive subject matter is directed to an artificial intelligence intra-operative surgical guidance system and method of use. The artificial intelligence intra-operative surgical guidance system is made of a computer executing one or more automated artificial intelligence models trained on data layer datasets collections to calculate surgical decision risks, and provide intra-operative surgical guidance; and a display configured to provide visual guidance to a user.

Artificial intelligence intra-operative surgical guidance system and method of use

The inventive subject matter is directed to an artificial intelligence intra-operative surgical guidance system and method of use. The artificial intelligence intra-operative surgical guidance system is made of a computer executing one or more automated artificial intelligence models trained on data layer datasets collections to calculate surgical decision risks, and provide intra-operative surgical guidance; and a display configured to provide visual guidance to a user.

Unified framework for multi-modal similarity search

Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects.

Deep learning based identification of difficult to test nodes

Techniques to improve the accuracy and speed for detection and remediation of difficult to test nodes in a circuit design netlist. The techniques utilize improved netlist representations, test point insertion, and trained neural networks.

Compatibility based furniture recommendations

Examples disclosed herein are relevant to systems, methods, and other technology for determining furniture compatibility. For example, graph neural networks (GNNs) that leverage relational information between furniture items in a set may be used as models to predict a compatibility score indicative of visual compatibility of furniture items across the set. In one implementation, the GNN-based model can extend the concept of a siamese network to multiple inputs and branches and use a generalized contrastive loss function. In another implementation, the GNN-based model learns both an edge function and the function that generates the compatibility score. The predicted compatibility score can be used for a variety of purposes, including furniture item recommendations.

Method and System for Disease Quantification of Anatomical Structures

This disclosure discloses a method and system for predicting disease quantification parameters for an anatomical structure. The method includes extracting a centerline structure based on a medical image. The method further includes predicting the disease quantification parameter for each sampling point on the extracted centerline structure by using a GNN, with each node corresponds to a sampling point on the extracted centerline structure and each edge corresponds to a spatial constraint relationship between the sampling points. For each node, a local feature is extracted based on the image patch for the corresponding sampling point by using a local feature encoder, and a global feature is extracted by using a global feature encoder based on a set of image patches for a set of sampling points, which include the corresponding sampling point and have a spatial constraint relationship defined by the centerline structure. Then, an embed feature is obtained based on both the local feature and the global feature and input into to the node. The method is able to integrate local and global consideration factors of the sampling points into the GNN to improve the prediction accuracy.

SYSTEMS AND METHODS FOR RETRIEVING VIDEOS USING NATURAL LANGUAGE DESCRIPTION
20230086735 · 2023-03-23 · ·

Implementations are directed to methods, systems, and computer-readable media for obtaining videos and extracting, from each video, a key frame for the video including a timestamp. For each key frame, a scene graph is generated. Generating the scene graph for the key frame includes identifying, objects in the image, and extracting a relationship feature defining a relationship between a first object and a second, different object of the objects in the key frame. The scene graph for the key frame is generated that includes a set of nodes and a set of edges. A natural language query request for a video is received, including terms defining a relationship between two or more particular objects. A query graph is generated for the natural language query request, and a set of videos corresponding to the set of scene graphs matching the query graph are provided for display on a user device.

SYSTEMS AND METHODS FOR RETRIEVING VIDEOS USING NATURAL LANGUAGE DESCRIPTION
20230086735 · 2023-03-23 · ·

Implementations are directed to methods, systems, and computer-readable media for obtaining videos and extracting, from each video, a key frame for the video including a timestamp. For each key frame, a scene graph is generated. Generating the scene graph for the key frame includes identifying, objects in the image, and extracting a relationship feature defining a relationship between a first object and a second, different object of the objects in the key frame. The scene graph for the key frame is generated that includes a set of nodes and a set of edges. A natural language query request for a video is received, including terms defining a relationship between two or more particular objects. A query graph is generated for the natural language query request, and a set of videos corresponding to the set of scene graphs matching the query graph are provided for display on a user device.