Patent classifications
G06F16/58
Method and system for facilitating keyword-based searching in images
Technologies are generally described for a system to extract description of reference numerals in images and facilitate keyword-based search in images. In various examples, the system may include one or more databases, a computer readable memory, and one or more processors. The system may be configured to extract one or more reference numerals from an image, and identify and extract corresponding description of the one or more reference numerals from a description document corresponding to the image. The system may be further configured to extract text from the images, and store the images in a database with the extracted data, i.e., text, reference numerals, and corresponding descriptions. The system may be further configured to receive an input query intending to search images related to a search logic of the input query, search a database to identify an image including either of text or a reference numeral having a corresponding description that corresponds to the search logic of the input query, and render the identified image via a display device executing the output interface.
Online, incremental real-time learning for tagging and labeling data streams for deep neural networks and neural network applications
Today, artificial neural networks are trained on large sets of manually tagged images. Generally, for better training, the training data should be as large as possible. Unfortunately, manually tagging images is time consuming and susceptible to error, making it difficult to produce the large sets of tagged data used to train artificial neural networks. To address this problem, the inventors have developed a smart tagging utility that uses a feature extraction unit and a fast-learning classifier to learn tags and tag images automatically, reducing the time to tag large sets of data. The feature extraction unit and fast-learning classifiers can be implemented as artificial neural networks that associate a label with features extracted from an image and tag similar features from the image or other images with the same label. Moreover, the smart tagging system can learn from user adjustment to its proposed tagging. This reduces tagging time and errors.
Automatic speaker identification in calls using multiple speaker-identification parameters
A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.
Automatic speaker identification in calls using multiple speaker-identification parameters
A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.
Drawing management apparatus and system
A drawing management apparatus includes a receiver, a searcher, and a presenter. The receiver receives input of information concerning a specific subject and information concerning a purpose of a design change to be made to the specific subject. The searcher searches for a pair of drawings which have characteristics similar to characteristics of the specific subject and which are constituted by a drawing to which a design change has been made in accordance with the purpose and a drawing to which the design change has not yet been made, on the basis of the information concerning the specific subject and the information concerning the purpose of the design change. The presenter presents the searched pair of drawings to a user.
Systems and methods for coordinated collection of street-level image data
The disclosed computer-implemented method may include (i) identifying, by a server computer system, a provider computing device for use in capturing street-level image data, where the provider computing device controls a camera positioned to capture street-level imagery outside the vehicle, (ii) determining, by the server computer system, a configuration that controls use of the provider computing device to provide street-level image data captured by the camera to the server computer system, (iii) sending, by the server computer system, the configuration to the computing device, and (iv) receiving, from the computing device, street-level image data captured by the computing device using the camera responsive to the configuration. Various other methods, systems, and computer-readable media are also disclosed.
METHOD AND APPARATUS FOR DETERMINING ITEM NAME, COMPUTER DEVICE, AND STORAGE MEDIUM
A method includes: obtaining a first image including a target item; selecting a plurality of reference images corresponding to the first image from a database; performing word segmentation on item text information corresponding to the plurality of reference images to obtain a plurality of words; and extracting a key word meeting a reference condition from the plurality of words, and determining the extracted key word as an item name of the target item.
Interactive geographical map
The systems, methods, and devices of the present disclosure may provide, among other features, high-performance, interactive geographical and/or data object map capabilities in which large amounts of geographical, geospatial, and other types of data, geodata, objects, features, and/or metadata are efficiently presented to a user on a map interface. In various embodiments, an interactive geographical map display system may enable rapid and deep analysis of various objects, features, and/or metadata by the user by aggregating and clustering large sets of data into aggregate values and clusters. The user can select various clusters, via the user interface, to interact with the data, clusters, and map.
Cloud-based large-scale pathological image collaborative annotation method and system
An image preprocessing module is provided by the present invention to pre-process pathological images through superpixel segmentation in the cloud infrastructure, and meanwhile stores the boundary thereof by using the binary image; and performing multi-user data recording by the data recording module, so that the user can collaborate on image annotation; through the front-end interaction module, the capturing action of the user and displaying the pathological image; through the back-end processing module, the responding to the request sent by the front-end interaction module and completing the annotation of the data and the suggesting possible annotation sub-regions to the users. Through the effects achieved by the above modules, the present invention accomplishes the purpose of multi-user collaborative and convenient pixel-level annotation on pathological images via a webpage.
Cloud-based large-scale pathological image collaborative annotation method and system
An image preprocessing module is provided by the present invention to pre-process pathological images through superpixel segmentation in the cloud infrastructure, and meanwhile stores the boundary thereof by using the binary image; and performing multi-user data recording by the data recording module, so that the user can collaborate on image annotation; through the front-end interaction module, the capturing action of the user and displaying the pathological image; through the back-end processing module, the responding to the request sent by the front-end interaction module and completing the annotation of the data and the suggesting possible annotation sub-regions to the users. Through the effects achieved by the above modules, the present invention accomplishes the purpose of multi-user collaborative and convenient pixel-level annotation on pathological images via a webpage.