Ab Initio Technology LLC

United States of America

Back to Profile

1-100 of 780 for Ab Initio Technology LLC Sort by
Query
Aggregations
Jurisdiction
        United States 346
        Canada 224
        World 210
Date
New (last 4 weeks) 14
2025 July (MTD) 2
2025 June 12
2025 March 1
2025 (YTD) 20
See more
IPC Class
G06F 17/30 - Information retrieval; Database structures therefor 133
G06F 9/44 - Arrangements for executing specific programs 90
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU] 72
G06F 8/34 - Graphical or visual programming 71
G06F 16/901 - IndexingData structures thereforStorage structures 55
See more
Status
Pending 84
Registered / In Force 696
Found results for  patents
  1     2     3     ...     8        Next Page

1.

TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

      
Application Number 18999645
Status Pending
Filing Date 2024-12-23
First Publication Date 2025-07-03
Owner Ab Initio Technology LLC (USA)
Inventor
  • Melbouci, Sandrick
  • Schon, Andrew

Abstract

Techniques for processing a dataset comprising data stored in fields to identify field labels. The field labels describe data stored in the dataset fields. The techniques determine whether any field labels in a field label glossary match a field. If none of the field labels in the field label glossary match the field, the techniques generate a new field label using the name of the field. The generated field label may be assigned to the field.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

2.

TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

      
Application Number US2024061691
Publication Number 2025/144787
Status In Force
Filing Date 2024-12-23
Publication Date 2025-07-03
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
Inventor
  • Melbouci, Sandrick
  • Schon, Andrew

Abstract

Techniques for processing a dataset comprising data stored in fields to identify field labels. The field labels describe data stored in the dataset fields. The techniques determine whether any field labels in a field label glossary match a field. If none of the field labels in the field label glossary match the field, the techniques generate a new field label using the name of the field. The generated field label may be assigned to the field.

IPC Classes  ?

  • G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
  • G06F 16/901 - IndexingData structures thereforStorage structures

3.

DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

      
Application Number 18987691
Status Pending
Filing Date 2024-12-19
First Publication Date 2025-06-26
Owner Ab Initio Technology LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Egenolf, Jonah

Abstract

A method for using a development environment to automatically generate code from a multi-tiered metadata model includes: receiving a specification to process a dataset, and, in response, accessing dataset characteristics and identifying controls received from a development environment to be applied to a field of the dataset in accordance with a metadata model by: accessing a first instance of a data structure that corresponds to the dataset; based on a reference in the first instance, accessing a second instance of a data structure associated with the field; based on a reference in the second instance, accessing a third instance of a data structure associated with metadata describing the field, and based on a reference in the third instance, accessing a fourth instance of a data structure storing a control defined based on the metadata. Based on the dataset characteristics, code is generated to apply the identified control to the field.

IPC Classes  ?

4.

LIVE INTERVAL ANALYSIS

      
Application Number US2024034454
Publication Number 2025/136448
Status In Force
Filing Date 2024-06-18
Publication Date 2025-06-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Bason, Iain

Abstract

An approach to allocation of referenced objects to memory resources addresses a situation in which there are a far greater number of memory resources, for example, 216 elements in the set of memory resources, and yet the objects referenced in a program specification exceeds this number. The approach is applicable to compilation of a program specification for execution on a physical or virtual processor.

IPC Classes  ?

5.

A DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

      
Application Number US2024061392
Publication Number 2025/137522
Status In Force
Filing Date 2024-12-20
Publication Date 2025-06-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Egenolf, Jonah

Abstract

A method for using a development environment to automatically generate code from a multi-tiered metadata model includes: receiving a specification to process a dataset, and, in response, accessing dataset characteristics and identifying controls received from a development environment to be applied to a field of the dataset in accordance with a metadata model by: accessing a first instance of a data structure that corresponds to the dataset; based on a reference in the first instance, accessing a second instance of a data structure associated with the field; based on a reference in the second instance, accessing a third instance of a data structure associated with metadata describing the field, and based on a reference in the third instance, accessing a fourth instance of a data structure storing a control defined based on the metadata. Based on the dataset characteristics, code is generated to apply the identified control to the field.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/23 - Updating
  • G06F 16/242 - Query formulation
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 40/30 - Semantic analysis
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

6.

Live Interval Analysis

      
Application Number 18746432
Status Pending
Filing Date 2024-06-18
First Publication Date 2025-06-26
Owner Ab Initio Technology LLC (USA)
Inventor Bason, Iain

Abstract

An approach to allocation of referenced objects to memory resources addresses a situation in which there are a far greater number of memory resources, for example, 216 elements in the set of memory resources, and yet the objects referenced in a program specification exceeds this number. The approach is applicable to compilation of a program specification for execution on a physical or virtual processor.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

7.

PARTITION-BASED ESCROW IN A DISTRIBUTED COMPUTING SYSTEM

      
Application Number US2024060078
Publication Number 2025/129027
Status In Force
Filing Date 2024-12-13
Publication Date 2025-06-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a processing component of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the processing component has persistently stored the first result data outside the distributed computing cluster.

IPC Classes  ?

  • G06F 11/14 - Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result

8.

Partition-based Escrow in a Distributed Computing System

      
Application Number 18980527
Status Pending
Filing Date 2024-12-13
First Publication Date 2025-06-19
Owner Ab Initio Technology LLC (USA)
Inventor Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a consumer of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the consumer has persistently stored the first result data outside the distributed computing cluster.

IPC Classes  ?

  • G06F 11/16 - Error detection or correction of the data by redundancy in hardware
  • G06F 16/23 - Updating

9.

DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

      
Application Number 18961714
Status Pending
Filing Date 2024-11-27
First Publication Date 2025-06-05
Owner Ab Initio Technology LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory Christopher James
  • Parks, Robert
  • Eads, Matthew Austin
  • Zimmerman, William Robert
  • Huang, Aaron
  • Becker, Matthew Douglas

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, the physical dataset used for performing data access operations may be selected based on current context. Current context may be specified based on values of system parameters and/or user specified values. The physical dataset accessed may be identified by selecting a record from multiple records in a dataset catalog associated with the logical dataset. Each record includes information to access a physical dataset associated with the selected record and context information to indicate the context in which the specific physical dataset is to be selected.

IPC Classes  ?

  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
  • G06F 16/901 - IndexingData structures thereforStorage structures

10.

TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

      
Application Number 18964167
Status Pending
Filing Date 2024-11-29
First Publication Date 2025-06-05
Owner Ab Initio Technology LLC (USA)
Inventor
  • Dickie, Garth Allen
  • Sciarappa, Luke Bryant
  • Miller, Zachary James
  • Schechter, Ian Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes  ?

11.

TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

      
Application Number US2024057981
Publication Number 2025/117907
Status In Force
Filing Date 2024-11-29
Publication Date 2025-06-05
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
Inventor
  • Dickie, Garth, Allen
  • Sciarappa, Luke, Bryant
  • Miller, Zachary, James
  • Schechter, Ian, Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes  ?

12.

AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

      
Application Number US2024058077
Publication Number 2025/117946
Status In Force
Filing Date 2024-12-02
Publication Date 2025-06-05
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Hawkins, Miranda
  • Isman, Marshall A.
  • Jankovic, Maja
  • Samoylovich, Alexander
  • Hurd, Ben

Abstract

A computer-implemented method for defining a test for a computer program includes receiving operational data generated during execution of a computer program in a first computing environment, the operational data indicative of (i) a data source accessed by the computer program during execution of the computer program and (ii) a destination to where baseline data records are output by the computer program during execution of the computer program. Based on the received operational data, a data storage object is generated that includes (i) input data records from the data source and the baseline data records from the destination, and (ii) test definition data for the first computing environment. Responsive to migration of the computer program to a second computing environment, the input and baseline data records from the data storage object are stored in the second computing environment. A test configuration is defined for the migrated computer program in the second computing environment according to the test definition data in the data storage object and the mapping between the first computing environment and the second computing environment, the test configuration for the migrated computer program identifying a location of the input data records and a location of the baseline data records in the second computing environment. Execution of the migrated computer program in the second computing environment is tested using the input data records and baseline data records in the second computing environment and according to the defined test configuration for the migrated computer program.

IPC Classes  ?

13.

AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

      
Application Number 18962909
Status Pending
Filing Date 2024-11-27
First Publication Date 2025-06-05
Owner Ab Initio Technology LLC (USA)
Inventor
  • Hawkins, Miranda
  • Isman, Marshall A.
  • Jankovic, Maja
  • Samoylovich, Alexander
  • Hurd, Ben

Abstract

A computer-implemented method for defining a test for a computer program includes receiving operational data generated during execution of a computer program in a first computing environment, the operational data indicative of (i) a data source accessed by the computer program during execution of the computer program and (ii) a destination to where baseline data records are output by the computer program during execution of the computer program. Based on the received operational data, a data storage object is generated that includes (i) input data records from the data source and the baseline data records from the destination, and (ii) test definition data for the first computing environment. Responsive to migration of the computer program to a second computing environment, the input and baseline data records from the data storage object are stored in the second computing environment. A test configuration is defined for the migrated computer program in the second computing environment according to the test definition data in the data storage object and the mapping between the first computing environment and the second computing environment, the test configuration for the migrated computer program identifying a location of the input data records and a location of the baseline data records in the second computing environment. Execution of the migrated computer program in the second computing environment is tested using the input data records and baseline data records in the second computing environment and according to the defined test configuration for the migrated computer program.

IPC Classes  ?

  • G06F 16/36 - Creation of semantic tools, e.g. ontology or thesauri

14.

DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

      
Application Number US2024057602
Publication Number 2025/117635
Status In Force
Filing Date 2024-11-27
Publication Date 2025-06-05
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory, Christopher James
  • Parks, Robert
  • Eads, Matthew, Austin
  • Zimmerman, William, Robert
  • Huang, Aaron
  • Becker, Matthew, Douglas

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, the physical dataset used for performing data access operations may be selected based on current context. Current context may be specified based on values of system parameters and/or user specified values. The physical dataset accessed may be identified by selecting a record from multiple records in a dataset catalog associated with the logical dataset. Each record includes information to access a physical dataset associated with the selected record and context information to indicate the context in which the specific physical dataset is to be selected.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

15.

CONVERSION OF DATA LINEAGES

      
Application Number US2024043393
Publication Number 2025/049234
Status In Force
Filing Date 2024-08-22
Publication Date 2025-03-06
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Wagner, Philippe

Abstract

The present disclosure relates to a computer-implemented method for conversion of a first data lineage to a second data lineage, the method comprising: obtaining a first data lineage specifying relationships among physical components of a plurality of physical components; receiving an identification of a portion of the first data lineage; generating a second data lineage from the identified portion of the first data lineage, the second data lineage specifying relationships among second components of a plurality of second components, wherein the second components of the plurality of second components are associated with at least some of the physical components of the identified portion of the first data lineage. A corresponding computer-readable medium, a corresponding a data processing system, and a corresponding computer program are also described.

IPC Classes  ?

16.

DATA SET EVALUATION BASED ON DATA LINEAGE ANALYSIS

      
Application Number US2024042084
Publication Number 2025/038623
Status In Force
Filing Date 2024-08-13
Publication Date 2025-02-20
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor De Jager, Dirk

Abstract

The present disclosure relates to a computer-implemented method, the method comprising: obtaining a data lineage whose structure specifies relationships among data sets of a plurality of data sets; analyzing the structure of the data lineage; based on a result of the analyzing, identifying a subset of the plurality of data sets for which a parameter is to be evaluated, wherein the subset includes one or more of the data sets, and wherein the parameter is for indicating a potential error within a data set; and outputting an indication of the identified subset of one or more data sets. A computer-readable medium, computer program, a corresponding data processing apparatus, and a data structure are described as well.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

17.

MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

      
Application Number US2024039543
Publication Number 2025/029579
Status In Force
Filing Date 2024-07-25
Publication Date 2025-02-06
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Chang, Andrew
  • Oberdorf, Richard
  • Wang, Song
  • Smith, Nathan

Abstract

Techniques for discovering primary, unique, and/or foreign keys for relational datasets are described. The techniques include profiling the relational datasets to obtain respective data profiles; identifying one or more primary key candidates for a first relational dataset using a first data profile of the first relational dataset and a first trained machine learning model; identifying one or more foreign key proposals for a second relational dataset using the one or more primary key candidates by performing a subset analysis of the second relational dataset with respect to the first relational dataset; identifying one or more foreign key candidates for the second relational dataset using the first data profile, a second data profile of the second relational dataset, and a second trained machine learning model different from the first trained machine learning model; and outputting the at primary key candidate(s) and the foreign key candidate(s).

IPC Classes  ?

18.

SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

      
Application Number 18736974
Status Pending
Filing Date 2024-06-07
First Publication Date 2025-01-30
Owner Ab Initio Technology LLC (USA)
Inventor Dickie, Garth Allen

Abstract

Techniques for determining processing layouts to nodes of a dataflow graph. The techniques include: obtaining information specifying a dataflow graph, the dataflow graph comprising a plurality of nodes and a plurality of edges connecting the plurality nodes, the plurality of edges representing flows of data among nodes in the plurality of nodes, the plurality of nodes comprising: a first set of one or more nodes; and a second set of one or more nodes disjoint from the first set of nodes; obtaining a first set of one or more processing layouts for the first set of nodes; and determining a processing layout for each node in the second set of nodes based on the first set of processing layouts and one or more layout determination rules, the one or more layout determination rules including at least one rule for selecting among processing layouts having different degrees of parallelism, and information indicating that data generated by at least one node in the first and/or third set of nodes is not used by any nodes in the dataflow graph downstream from the at least one node.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

19.

MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

      
Application Number 18783851
Status Pending
Filing Date 2024-07-25
First Publication Date 2025-01-30
Owner Ab Initio Technology LLC (USA)
Inventor
  • Chang, Andrew
  • Oberdorf, Richard
  • Wang, Song
  • Smith, Nathan

Abstract

Techniques for discovering primary, unique, and/or foreign keys for relational datasets are described. The techniques include profiling the relational datasets to obtain respective data profiles; identifying one or more primary key candidates for a first relational dataset using a first data profile of the first relational dataset and a first trained machine learning model; identifying one or more foreign key proposals for a second relational dataset using the one or more primary key candidates by performing a subset analysis of the second relational dataset with respect to the first relational dataset; identifying one or more foreign key candidates for the second relational dataset using the first data profile, a second data profile of the second relational dataset, and a second trained machine learning model different from the first trained machine learning model; and outputting the at primary key candidate(s) and the foreign key candidate(s).

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 40/30 - Semantic analysis

20.

FOR DATA OBJECTS MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number 18909013
Status Pending
Filing Date 2024-10-08
First Publication Date 2025-01-23
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Yeracaris, Anthony
  • Radivojevic, Dusan

Abstract

Techniques for using finite state machines (FSMs) to implement workflows in a data processing system comprising at least one data store storing data objects and a workflow management system (WMS). The WMS is configured to perform: determining a current value of an attribute of a first data object by accessing the current value in the at least one data store; identifying, using the current value and metadata specifying relationships among at least some of the data objects, an actor authorized to perform a workflow task for the first data object; generating a GUI through which the actor can provide the input that the workflow task is to be performed; and in response to receiving, from the actor and through the GUI, input specifying that the workflow task is to be performed: performing the workflow task; and updating the current workflow state of the first FSM to a second workflow state.

IPC Classes  ?

  • G06Q 10/10 - Office automationTime management
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 16/906 - ClusteringClassification

21.

REUSABLE DATA PROCESSING PROGRAM GENERATION

      
Application Number US2024024224
Publication Number 2024/258483
Status In Force
Filing Date 2024-04-12
Publication Date 2024-12-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Pribble, Molly
  • Keare, Avery
  • Stahlecker, Dirk
  • Abaya, Victor, T.
  • Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and enabling a user to select one or more data transformation steps, applying the data transformation steps to the records, and displaying the transformed records.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/904 - BrowsingVisualisation therefor

22.

REUSABLE DATA PROCESSING PROGRAM GENERATION

      
Application Number 18633887
Status Pending
Filing Date 2024-04-12
First Publication Date 2024-12-12
Owner Ab Initio Technology LLC (USA)
Inventor
  • Weisman, Amit
  • Pribble, Molly
  • Keare, Avery
  • Stahlecker, Dirk
  • Abaya, Victor T.
  • Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and iteratively enabling a user to select one or more data transformation steps, iteratively applying the data transformation steps to the records, and iteratively displaying the transformed records.

IPC Classes  ?

23.

MIGRATION OF DATASETS AMONG FEDERATED DATABASE SYSTEMS

      
Application Number US2024028621
Publication Number 2024/233801
Status In Force
Filing Date 2024-05-09
Publication Date 2024-11-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Bach, Edward, Alan
  • Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor

24.

Migration of datasets among federated database systems

      
Application Number 18442567
Grant Number 12346288
Status In Force
Filing Date 2024-02-15
First Publication Date 2024-11-14
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Bach, Edward Alan
  • Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/11 - File system administration, e.g. details of archiving or snapshots
  • G06F 16/13 - File access structures, e.g. distributed indices
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

25.

Meta-data driven ingestion and data processing

      
Application Number 18496543
Grant Number 12346350
Status In Force
Filing Date 2023-10-27
First Publication Date 2024-10-17
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes  ?

  • G06F 16/20 - Information retrievalDatabase structures thereforFile system structures therefor of structured data, e.g. relational data
  • G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

26.

METADATA DRIVEN INGESTION AND DATA PROCESSING

      
Application Number US2024024348
Publication Number 2024/216088
Status In Force
Filing Date 2024-04-12
Publication Date 2024-10-17
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

27.

Systems and methods for accessing data entities managed by a data processing system

      
Application Number 18415067
Grant Number 12254002
Status In Force
Filing Date 2024-01-17
First Publication Date 2024-10-17
Grant Date 2025-03-18
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gylfason, Halldor Isak
  • Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes  ?

28.

On-demand integration of records with data catalog identifiers

      
Application Number 18491324
Grant Number 12339826
Status In Force
Filing Date 2023-10-20
First Publication Date 2024-10-10
Grant Date 2025-06-24
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/22 - IndexingData structures thereforStorage structures

29.

ON-DEMAND INTEGRATION OF RECORDS WITH DATA CATALOG IDENTIFIERS

      
Application Number US2024023227
Publication Number 2024/211674
Status In Force
Filing Date 2024-04-05
Publication Date 2024-10-10
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields of datasets from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes  ?

  • G06F 16/242 - Query formulation
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/9035 - Filtering based on additional data, e.g. user or group profiles
  • G06F 16/9535 - Search customisation based on user profiles and personalisation

30.

Logical Access for Previewing Expanded View Datasets

      
Application Number 18492904
Status Pending
Filing Date 2023-10-24
First Publication Date 2024-09-26
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah
  • Schechter, Ian

Abstract

A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

IPC Classes  ?

31.

USING METADATA TO DRIVE BATCH AND REAL-TIME PROCESSING IN DATA PROCESSING SYSTEMS

      
Application Number US2024020140
Publication Number 2024/196759
Status In Force
Filing Date 2024-03-15
Publication Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a data record (e.g., a dynamic data record) based on a request for the real-time decisioning, wherein the data record includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

32.

LOGICAL ACCESS FOR PREVIEWING EXPANDED VIEW DATASETS

      
Application Number US2024021174
Publication Number 2024/197264
Status In Force
Filing Date 2024-03-22
Publication Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah
  • Schechter, Ian

Abstract

A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

IPC Classes  ?

  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 16/957 - Browsing optimisation, e.g. caching or content distillation
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

33.

Using Metadata to Drive Batch and Real-Time Processing in Data Processing Systems

      
Application Number 18491601
Status Pending
Filing Date 2023-10-20
First Publication Date 2024-09-19
Owner Ab Initio Technology LLC (USA)
Inventor
  • Isman, Marshall A.
  • Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a record (e.g., dynamic record) based on a request for the real-time decisioning, wherein the record (e.g., dynamic record) includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes  ?

34.

SYSTEMS AND METHODS FOR DATAFLOW GRAPH OPTIMIZATION

      
Application Number 18670461
Status Pending
Filing Date 2024-05-21
First Publication Date 2024-09-19
Owner Ab Initio Technology LLC (USA)
Inventor Dickie, Garth Allen

Abstract

At least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by at least one computer hardware processor, cause the at least one computer hardware processor to perform: obtaining an automatically generated initial dataflow graph, the initial dataflow graph comprising a first plurality of nodes representing a first plurality of data processing operations and a first plurality of links representing flows of data among nodes in the first plurality of nodes; and generating an updated dataflow graph by iteratively applying dataflow graph optimization rules to update the initial dataflow graph, the updated dataflow graph comprising a second plurality of nodes representing a second plurality of data processing operations and a second plurality of links representing flows of data among nodes in the second plurality of nodes.

IPC Classes  ?

35.

BACK-CALCULATION OF AGGREGATES

      
Application Number US2024014447
Publication Number 2024/167838
Status In Force
Filing Date 2024-02-05
Publication Date 2024-08-15
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie

Abstract

Described are techniques for back-calculating one or more values of a new, real-time aggregate before sufficient data to calculate the new, real-time aggregate has been collected, wherein the back-calculating is based on data collected for one or more aggregates that have been executing prior to start of execution of the new, real-time aggregate.

IPC Classes  ?

  • G06F 17/18 - Complex mathematical operations for evaluating statistical data
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

36.

Processing Structured Data for Real-time Segmentation

      
Application Number 18486300
Status Pending
Filing Date 2023-10-13
First Publication Date 2024-08-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie
  • Banks, Matthew John

Abstract

A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • H04L 67/55 - Push-based network services

37.

Back-Calculation of Aggregates

      
Application Number 18233418
Status Pending
Filing Date 2023-08-14
First Publication Date 2024-08-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie

Abstract

Described are techniques for back-calculating one or more values of a new, real-time aggregate before sufficient data to calculate the new, real-time aggregate has been collected, wherein the back-calculating is based on data collected for one or more aggregates that have been executing prior to start of execution of the new, real-time aggregate.

IPC Classes  ?

  • G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
  • G06F 16/242 - Query formulation

38.

PROCESSING STRUCTURED DATA FOR REAL-TIME SEGMENTATION

      
Application Number US2024014305
Publication Number 2024/163937
Status In Force
Filing Date 2024-02-02
Publication Date 2024-08-08
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie
  • Banks, Matthew John

Abstract

A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.

IPC Classes  ?

39.

ON-DEMAND RETRIEVAL OF STRUCTURED DATA IN AGGREGATING DATA ACROSS DISTINCT SOURCES

      
Application Number US2024012793
Publication Number 2024/158920
Status In Force
Filing Date 2024-01-24
Publication Date 2024-08-02
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Gould, Joel

Abstract

A method for enabling a user to generate a complex aggregation on their own by providing the user with a graphical user interface that displays data items in a data catalog and that provides controls for the user to select data items to be used in generating the complex aggregation, and to select a type of aggregation, and based on the user's selections, automatically generating computer instructions to generate a value of the complex aggregation is described.

IPC Classes  ?

  • G06F 16/11 - File system administration, e.g. details of archiving or snapshots
  • G06F 16/16 - File or folder operations, e.g. details of user interfaces specifically adapted to file systems
  • G06F 8/34 - Graphical or visual programming
  • G06F 9/44 - Arrangements for executing specific programs

40.

DATA PROCESSING SYSTEM WITH MANIPULATION OF LOGICAL DATASET GROUPS

      
Application Number 18434546
Status Pending
Filing Date 2024-02-06
First Publication Date 2024-08-01
Owner Ab Initio Technology LLC (USA)
Inventor Weisman, Amit

Abstract

A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 3/0482 - Interaction with lists of selectable items, e.g. menus
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/248 - Presentation of query results

41.

On-Demand Retrieval of Structured Data in Aggregating Data Across Distinct Sources

      
Application Number 18338180
Status Pending
Filing Date 2023-06-20
First Publication Date 2024-07-25
Owner Ab Initio Technology LLC (USA)
Inventor Gould, Joel

Abstract

A method for enabling a user to generate a complex aggregation on their own by providing the user with a graphical user interface that displays data items in a data catalog and that provides controls for the user to select data items to be used in generating the complex aggregation, and to select a type of aggregation, and based on the user's selections, automatically generating computer instructions to generate a value of the complex aggregation is described.

IPC Classes  ?

  • G06F 16/248 - Presentation of query results
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 16/2455 - Query execution
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

42.

Display panel portion with an animated computer icon

      
Application Number 29831258
Grant Number D1032627
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-25
Grant Date 2024-06-25
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

43.

Display panel portion with an animated computer icon

      
Application Number 29831275
Grant Number D1032628
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-25
Grant Date 2024-06-25
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

44.

Display panel portion with an animated computer icon

      
Application Number 29831244
Grant Number D1031741
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-18
Grant Date 2024-06-18
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

45.

Display panel portion with an animated computer icon

      
Application Number 29831176
Grant Number D1031740
Status In Force
Filing Date 2022-03-17
First Publication Date 2024-06-18
Grant Date 2024-06-18
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

46.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18545416
Grant Number 12242444
Status In Force
Filing Date 2023-12-19
First Publication Date 2024-05-09
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

47.

SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

      
Application Number 18278264
Status Pending
Filing Date 2022-02-23
First Publication Date 2024-05-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Polstra, Drew
  • Parks, Robert

Abstract

Techniques for managing access privileges in a data processing system include obtaining a plurality of rules for granting and/or denying privileges to a first actor to perform at least one action on a first instance of a first data entity of data entities; identifying, from among attributes of the first data entity, a first attribute whose values are used by one or more of the plurality of rules; obtaining, from a user or from at least one data store, a first value of the first attribute; identifying, using the first value and from among the plurality of rules, a first rule that depends on the first value; generating a graphical user interface (GUI) including a visual rendering of at least some of the plurality of rules, the visual rendering emphasizing the first rule identified using the first value of the first attribute; and displaying the generated GUI to the user.

IPC Classes  ?

  • H04L 9/40 - Network security protocols
  • H04L 41/22 - Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]

48.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18399522
Grant Number 12242442
Status In Force
Filing Date 2023-12-28
First Publication Date 2024-04-18
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

49.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18399545
Grant Number 12242443
Status In Force
Filing Date 2023-12-28
First Publication Date 2024-04-18
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

50.

Dynamic distribution of container images

      
Application Number 18492173
Grant Number 12307262
Status In Force
Filing Date 2023-10-23
First Publication Date 2024-04-18
Grant Date 2025-05-20
Owner Ab Initio Technology LLC (USA)
Inventor
  • Lynch, Frank
  • Wakeling, Tim

Abstract

A method implemented by a data processing system including: accessing the container image that includes the first application and a second application; determining, by the data processing system, the number of parallel executions of the given module of the first application; for the given module, generating a plurality of instances of the container image in accordance with the number of parallel executions determined, for each instance, configuring that instance to execute the given module of the first application; causing each of the plurality of configured instances to execute on one or more of the host systems; and for at least one of the plurality of configured instances, causing, by the second application of that configured instance, communication between the data processing system and the one or more of the host systems executing that configured instance.

IPC Classes  ?

  • G06F 9/445 - Program loading or initiating
  • G06F 8/61 - Installation
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 9/455 - EmulationInterpretationSoftware simulation, e.g. virtualisation or emulation of application or operating system execution engines
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

51.

Generation of optimized logic from a schema

      
Application Number 18345852
Grant Number 12346312
Status In Force
Filing Date 2023-06-30
First Publication Date 2024-04-18
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Egenolf, Jonah
  • Isman, Marshall A.
  • Schechter, Ian

Abstract

A method includes accessing a schema that specifies relationships among datasets, computations on the datasets, or transformations of the datasets, selecting a dataset from among the datasets, and identifying, from the schema, other datasets that are related to the selected dataset. Attributes of the datasets are identified, and logical data representing the identified attributes and relationships among the attributes is generated. The logical data is provided to a development environment, which provides access to portions of the logical data representing the identified attributes. A specification that specifies at least one of the identified attributes in performing an operation is received from the development environment. Based on the specification and the relationships among the identified attributes represented by the logical data, a computer program is generated to perform the operation by accessing, from storage, at least one dataset having the at least one of the attributes specified in the specification.

IPC Classes  ?

  • G06F 8/34 - Graphical or visual programming
  • G06F 8/36 - Software reuse
  • G06F 8/38 - Creation or generation of source code for implementing user interfaces
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/23 - Updating
  • G06F 16/242 - Query formulation
  • G06F 16/2452 - Query translation
  • G06F 16/2453 - Query optimisation
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06Q 10/10 - Office automationTime management
  • G06Q 30/0242 - Determining effectiveness of advertisements

52.

TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

      
Application Number US2023074617
Publication Number 2024/064705
Status In Force
Filing Date 2023-09-19
Publication Date 2024-03-28
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously- determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

53.

TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

      
Document Number 03268252
Status Pending
Filing Date 2023-09-19
Open to Public Date 2024-03-28
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

54.

Publishing to a data warehouse

      
Application Number 18492425
Grant Number 12277135
Status In Force
Filing Date 2023-10-23
First Publication Date 2024-03-28
Grant Date 2025-04-15
Owner Ab Initio Technology LLC (USA)
Inventor
  • Blom, Andrew
  • Miller, Darren
  • Isman, Marshall A.

Abstract

A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.

IPC Classes  ?

  • G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
  • G06F 8/34 - Graphical or visual programming
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 17/00 - Digital computing or data processing equipment or methods, specially adapted for specific functions
  • H04L 67/565 - Conversion or adaptation of application format or content

55.

Techniques for discovering and updating semantic meaning of data fields

      
Application Number 18470405
Grant Number 12141107
Status In Force
Filing Date 2023-09-19
First Publication Date 2024-03-21
Grant Date 2024-11-12
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously-determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 40/30 - Semantic analysis

56.

TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number US2023070643
Publication Number 2024/054726
Status In Force
Filing Date 2023-07-20
Publication Date 2024-03-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Clemens, David
  • Procops, Roy

Abstract

Some embodiments provide techniques of enforcing valid data assignments in a data processing system in which data can be dynamically updated by user devices and/or computerized processes. The techniques identify, using a validation rule associated with a data entity, one or more valid values for assignment to an attribute of an instance of the data entity. The techniques identify the valid value(s) by generating a query for the one or more valid values using one or more condition(s) on the attribute in the validation rule, and executing the generated query to obtain the one or more valid values for the first attribute. The attribute may then be assigned one or more of the identified valid value(s).

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

57.

TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number 18356153
Status Pending
Filing Date 2023-07-20
First Publication Date 2024-03-14
Owner Ab Initio Technology LLC (USA)
Inventor
  • Clemens, David
  • Procops, Roy

Abstract

Some embodiments provide techniques of enforcing valid data assignments in a data processing system in which data can be dynamically updated by user devices and/or computerized processes. The techniques identify, using a validation rule associated with a data entity, one or more valid values for assignment to an attribute of an instance of the data entity. The techniques identify the valid value(s) by generating a query for the one or more valid values using one or more condition(s) on the attribute in the validation rule, and executing the generated query to obtain the one or more valid values for the first attribute. The attribute may then be assigned one or more of the identified valid value(s).

IPC Classes  ?

  • G06F 16/2457 - Query processing with adaptation to user needs
  • G06F 16/2453 - Query optimisation
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

58.

GENERATING, ACCESSING, AND DISPLAYING LINEAGE METADATA

      
Application Number 18345706
Status Pending
Filing Date 2023-06-30
First Publication Date 2024-03-07
Owner Ab Initio Technology LLC (USA)
Inventor
  • Clemens, David
  • Radivojevic, Dusan
  • Galarneau, Neil

Abstract

Among other things, we describe a method of receiving a portion of metadata from a data source, the portion of metadata describing nodes and edges; generating instances of a data structure representing the portion of metadata, at least one instance of the data structure including an identification value that identifies a corresponding node, one or more property values representing respective properties of the corresponding node, and one or more pointers to respective identification values, each pointer representing an edge associated with a node identified by the corresponding respective identification value; storing the instances of the data structure in random access memory; receiving a query that includes an identification of at least one particular element of data; and using at least one instance of the data structure to cause a display of a computer system to display a representation of lineage of the particular element of data.

IPC Classes  ?

59.

OPERATIONALIZING METADATA

      
Application Number US2023029810
Publication Number 2024/044041
Status In Force
Filing Date 2023-08-09
Publication Date 2024-02-29
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

Abstract

A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

60.

OPERATIONALIZING METADATA

      
Document Number 03265601
Status Pending
Filing Date 2023-08-09
Open to Public Date 2024-02-29
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

61.

OPERATIONALIZING METADATA

      
Application Number 18104066
Status Pending
Filing Date 2023-01-31
First Publication Date 2024-02-29
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

Abstract

A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 16/901 - IndexingData structures thereforStorage structures

62.

Editor for generating computational graphs

      
Application Number 18112958
Grant Number 12050606
Status In Force
Filing Date 2023-02-22
First Publication Date 2024-01-25
Grant Date 2024-07-30
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schechter, Ian
  • Dickie, Garth

Abstract

Techniques for generating a dataflow graph include generating a first dataflow graph with a plurality of first nodes representing first computer operations in processing data, with at least one of the first computer operations being a declarative operation that specifies one or more characteristics of one or more results of processing of data, and transforming the first dataflow graph into a second dataflow graph for processing data in accordance with the first computer operations, the second dataflow graph including a plurality of second nodes representing second computer operations, with at least one of the second nodes representing one or more imperative operations that implement the logic specified by the declarative operation, where the one or more imperative operations are unrepresented by the first nodes in the first dataflow graph.

IPC Classes  ?

  • G06F 16/2455 - Query execution
  • G06F 9/30 - Arrangements for executing machine instructions, e.g. instruction decode
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
  • G06F 16/242 - Query formulation
  • G06F 16/2457 - Query processing with adaptation to user needs
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures

63.

DISCOVERING A SEMANTIC MEANING OF DATA FIELDS FROM PROFILE DATA OF THE DATA FIELDS

      
Application Number 18201545
Status Pending
Filing Date 2023-05-24
First Publication Date 2023-12-21
Owner Ab Initio Technology LLC (USA)
Inventor
  • Butler, Christopher Thurston
  • Bush, Timothy Spencer

Abstract

A data processing system for discovering a semantic meaning of a field included in one or more data sets is configured to identify a field included in one or more data sets, with the field having an identifier. For that field, the system profiles data values of the field to generate a data profile, accesses a plurality of label proposal tests, and generates a set of label proposals by applying the plurality of label proposal tests to the data profile. The system determines a similarity among the label proposals and selects a classification. The system identifies one of the label proposals as identifying the semantic meaning. The system stores the identifier of the field with the identified one of the label proposals that identifies the semantic meaning.

IPC Classes  ?

  • G06F 40/30 - Semantic analysis
  • G06F 16/93 - Document management systems
  • G06N 20/00 - Machine learning
  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

64.

DATAFLOW GRAPH DATASETS

      
Document Number 03256554
Status Pending
Filing Date 2023-02-24
Open to Public Date 2023-11-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schechter, Ian Robert
  • Dickie, Garth Allen
  • Egenolf, Jonah
  • Isman, Marshall

IPC Classes  ?

65.

DATAFLOW GRAPH DATASETS

      
Application Number 18114212
Status Pending
Filing Date 2023-02-24
First Publication Date 2023-11-09
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schechter, Ian Robert
  • Dickie, Garth Allen
  • Egenolf, Jonah
  • Isman, Marshall

Abstract

Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures

66.

DATAFLOW GRAPH DATASETS

      
Application Number US2023013841
Publication Number 2023/215028
Status In Force
Filing Date 2023-02-24
Publication Date 2023-11-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schechter, Ian, Robert
  • Dickie, Garth, Allen
  • Egenolf, Jonah
  • Isman, Marshall

Abstract

Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

IPC Classes  ?

67.

Dynamic execution of parameterized applications for the processing of keyed network data streams

      
Application Number 18316333
Grant Number 11983548
Status In Force
Filing Date 2023-05-12
First Publication Date 2023-09-07
Grant Date 2024-05-14
Owner Ab Initio Technology LLC (USA)
Inventor
  • Ravid, Oded
  • Murphy, Trevor

Abstract

A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.

IPC Classes  ?

  • G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
  • G06F 9/445 - Program loading or initiating
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 16/178 - Techniques for file synchronisation in file systems
  • G06F 16/2455 - Query execution
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 8/41 - Compilation

68.

CHARACTERIZING DATA SOURCES IN A DATA STORAGE SYSTEM

      
Application Number 17860568
Status Pending
Filing Date 2022-07-08
First Publication Date 2023-06-01
Owner Ab Initio Technology LLC (USA)
Inventor Anderson, Arlen

Abstract

Characterizing data includes: reading data from an interface to a data storage system, and storing two or more sets of summary data summarizing data stored in different respective data sources in the data storage system; and processing the stored sets of summary data to generate system information characterizing data from multiple data sources in the data storage system. The processing includes: analyzing the stored sets of summary data to select two or more data sources that store data satisfying predetermined criteria, and generating the system information including information identifying a potential relationship between fields of records included in different data sources based at least in part on comparison between values from a stored set of summary data summarizing a first of the selected data sources and values from a stored set of summary data summarizing a second of the selected data sources.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/23 - Updating

69.

AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

      
Document Number 03233392
Status Pending
Filing Date 2022-10-06
Open to Public Date 2023-04-13
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Bach, Edward Alan

Abstract

A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

IPC Classes  ?

  • G06F 8/76 - Adapting program code to run in a different environmentPorting

70.

AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

      
Application Number US2022077666
Publication Number 2023/060170
Status In Force
Filing Date 2022-10-06
Publication Date 2023-04-13
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Bach, Edward Alan

Abstract

A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

IPC Classes  ?

  • G06F 8/76 - Adapting program code to run in a different environmentPorting

71.

AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

      
Application Number 17704469
Status Pending
Filing Date 2022-03-25
First Publication Date 2023-04-13
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Bach, Edward Alan

Abstract

A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

IPC Classes  ?

  • G06F 8/65 - Updates
  • G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor
  • H04L 67/06 - Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
  • H04L 67/00 - Network arrangements or protocols for supporting network services or applications

72.

SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

      
Application Number US2022045358
Publication Number 2023/056003
Status In Force
Filing Date 2022-09-30
Publication Date 2023-04-06
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Dickie, Garth, Allen

Abstract

Techniques for determining processing layouts to nodes of a dataflow graph. The techniques include: obtaining information specifying a dataflow graph, the dataflow graph comprising a plurality of nodes and a plurality of edges connecting the plurality nodes, the plurality of edges representing flows of data among nodes in the plurality of nodes, the plurality of nodes comprising: a first set of one or more nodes; and a second set of one or more nodes disjoint from the first set of nodes; obtaining a first set of one or more processing layouts for the first set of nodes; and determining a processing layout for each node in the second set of nodes based on the first set of processing layouts and one or more layout determination rules, the one or more layout determination rules including at least one rule for selecting among processing layouts having different degrees of parallelism, and information indicating that data generated by at least one node in the first and/or third set of nodes is not used by any nodes in the dataflow graph downstream from the at least one node.

IPC Classes  ?

  • G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt
  • G06F 8/41 - Compilation
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

73.

SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

      
Application Number 17957646
Status Pending
Filing Date 2022-09-30
First Publication Date 2023-03-30
Owner
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO TECHNOLOGY LLC (USA)
Inventor Dickie, Garth Allen

Abstract

Techniques for determining processing layouts to nodes of a dataflow graph. The techniques include: obtaining information specifying a dataflow graph, the dataflow graph comprising a plurality of nodes and a plurality of edges connecting the plurality nodes, the plurality of edges representing flows of data among nodes in the plurality of nodes, the plurality of nodes comprising: a first set of one or more nodes; and a second set of one or more nodes disjoint from the first set of nodes; obtaining a first set of one or more processing layouts for the first set of nodes; and determining a processing layout for each node in the second set of nodes based on the first set of processing layouts and one or more layout determination rules, the one or more layout determination rules including at least one rule for selecting among processing layouts having different degrees of parallelism, and information indicating that data generated by at least one node in the first and/or third set of nodes is not used by any nodes in the dataflow graph downstream from the at least one node.

IPC Classes  ?

  • G06F 16/248 - Presentation of query results
  • G06F 16/22 - IndexingData structures thereforStorage structures

74.

METADATA-DRIVEN DATA INGESTION

      
Application Number 17665109
Status Pending
Filing Date 2022-02-04
First Publication Date 2023-03-30
Owner Ab Initio Technology LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Parks, Robert
  • Weiss, Adam
  • Jankovic, Maja
  • Vickery, John

Abstract

An electronic system for increasing the speed of preparing data with a specified data quality for storage by automatically identifying for a user, with minimal user input, common contexts among (i) fields in disparate datasets, and (ii) names the user has specified as potentially describing the fields, and by using those common contexts to govern the disparate datasets prior to storage to ensure the specified data quality.

IPC Classes  ?

  • G06F 3/06 - Digital input from, or digital output to, record carriers

75.

METADATA-DRIVEN DATA INGESTION

      
Document Number 03230753
Status Pending
Filing Date 2022-09-16
Open to Public Date 2023-03-23
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Parks, Robert
  • Weiss, Adam
  • Jankovic, Maja
  • Vickery, John

Abstract

An electronic system for increasing the speed of preparing data with a specified data quality for storage by automatically identifying for a user, with minimal user input, common contexts among (i) fields in disparate datasets, and (ii) names the user has specified as potentially describing the fields, and by using those common contexts to govern the disparate datasets prior to storage to ensure the specified data quality.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

76.

METADATA-DRIVEN DATA INGESTION

      
Application Number US2022076595
Publication Number 2023/044445
Status In Force
Filing Date 2022-09-16
Publication Date 2023-03-23
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Parks, Robert
  • Weiss, Adam
  • Jankovic, Maja
  • Vickery, John

Abstract

An electronic system for increasing the speed of preparing data with a specified data quality for storage by automatically identifying for a user, with minimal user input, common contexts among (i) fields in disparate datasets, and (ii) names the user has specified as potentially describing the fields, and by using those common contexts to govern the disparate datasets prior to storage to ensure the specified data quality.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

77.

Database Security

      
Application Number 17987387
Status Pending
Filing Date 2022-11-15
First Publication Date 2023-03-09
Owner Ab Initio Technology LLC (USA)
Inventor Winters, Christopher J.

Abstract

A method includes automatically determining a component of a security label for each first record in a first table of a database having multiple tables, including: identifying a second record related to the first record according to a foreign key relationship; identifying a component of the security label for the second record; and assigning a value for the component of the security label for the first record based on the identified component of the security label for the second record. The method includes storing the determined security label in the record.

IPC Classes  ?

  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
  • G06F 21/60 - Protecting data
  • G06F 16/23 - Updating
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

78.

DATA GOVERNANCE SYSTEMS AND METHODS

      
Document Number 03223528
Status Pending
Filing Date 2022-06-07
Open to Public Date 2022-12-22
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Franquin, Pierre
  • Krigelman, Ken
  • Schon, Andy
  • Voshell, Justin

Abstract

Some embodiments relate to a method for use in connection with governance of a plurality of data assets managed by a data processing system, the method comprising: using at least one computer hardware processor to perform: accessing a data governance policy comprising a first data standard (e.g., by obtaining information about the first standard stored in a database system); generating a first data asset collection at least in part by automatically selecting, from among the plurality of data assets managed by the data processing system and using at least one data asset criterion, one or more data assets that meet the at least one data asset criterion; associating the first data asset collection with the first data standard; and verifying whether at least one of the one or more data assets in the first data asset collection complies with the first data standard.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

79.

DATA GOVERNANCE SYSTEMS AND METHODS

      
Application Number US2022032485
Publication Number 2022/265884
Status In Force
Filing Date 2022-06-07
Publication Date 2022-12-22
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Franquin, Pierre
  • Krigelman, Ken
  • Schon, Andy
  • Voshell, Justin

Abstract

Some embodiments relate to a method for use in connection with governance of a plurality of data assets managed by a data processing system, the method comprising: using at least one computer hardware processor to perform: accessing a data governance policy comprising a first data standard (e.g., by obtaining information about the first standard stored in a database system); generating a first data asset collection at least in part by automatically selecting, from among the plurality of data assets managed by the data processing system and using at least one data asset criterion, one or more data assets that meet the at least one data asset criterion; associating the first data asset collection with the first data standard; and verifying whether at least one of the one or more data assets in the first data asset collection complies with the first data standard.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

80.

Data governance systems and methods

      
Application Number 17834492
Grant Number 12287893
Status In Force
Filing Date 2022-06-07
First Publication Date 2022-12-15
Grant Date 2025-04-29
Owner Ab Initio Technology LLC (USA)
Inventor
  • Franquin, Pierre
  • Krigelman, Ken
  • Schon, Andrew
  • Voshell, Justin

Abstract

Some embodiments relate to a method for use in connection with governance of a plurality of data assets managed by a data processing system, the method comprising: using at least one computer hardware processor to perform: accessing a data governance policy comprising a first data standard (e.g., by obtaining information about the first standard stored in a database system); generating a first data asset collection at least in part by automatically selecting, from among the plurality of data assets managed by the data processing system and using at least one data asset criterion, one or more data assets that meet the at least one data asset criterion; associating the first data asset collection with the first data standard; and verifying whether at least one of the one or more data assets in the first data asset collection complies with the first data standard.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/2455 - Query execution
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

81.

DYNAMIC COMPUTATIONAL RESOURCE ALLOCATION

      
Document Number 03217116
Status Pending
Filing Date 2022-06-03
Open to Public Date 2022-12-08
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schmidt, Stephen J.
  • Ricardo, David
  • Stanfill, Craig W.
  • Chu, Jacky Cheuk Kei
  • Vishniac, Ephraim Meriwether

Abstract

A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

82.

Dynamic Computational Resource Allocation

      
Application Number 17831978
Status Pending
Filing Date 2022-06-03
First Publication Date 2022-12-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schmidt, Stephen J.
  • Ricardo, David
  • Stanfill, Craig W.
  • Chu, Jacky Cheuk Kei
  • Vishniac, Ephraim Meriwether

Abstract

A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

83.

DYNAMIC COMPUTATIONAL RESOURCE ALLOCATION

      
Application Number US2022032139
Publication Number 2022/256638
Status In Force
Filing Date 2022-06-03
Publication Date 2022-12-08
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schmidt, Stephen, J.
  • Ricardo, David
  • Stanfill, Craig, W.
  • Chu, Jacky, Cheuk Kei
  • Vishniac, Ephraim, Meriwether

Abstract

A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

84.

SYSTEMS AND METHODS FOR DETERMINING RELATIONSHIPS AMONG DATA ELEMENTS

      
Application Number 17576572
Status Pending
Filing Date 2022-01-14
First Publication Date 2022-11-24
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Radivojevic, Dusan

Abstract

A data processing system configured to perform: obtaining a first data lineage representing relationships among physical data elements, the first data lineage being generated at least in part by performing at least one of: (a) analyzing source code of at least one computer program configured to access the physical data elements; and (b) analyzing information obtained during runtime of the at least one computer program; obtaining, based on user input, a second data lineage representing relationships among business data elements; obtaining an association between at least some of the physical data elements of the first data lineage and at least some of the business data elements of the second data lineage; and generating, based on the association between the physical data elements and the business data elements, an indication of agreement or discrepancy between the first data lineage and the second data lineage.

IPC Classes  ?

  • G06F 16/23 - Updating
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 11/36 - Prevention of errors by analysis, debugging or testing of software

85.

Processing data from multiple sources

      
Application Number 17878106
Grant Number 11720583
Status In Force
Filing Date 2022-08-01
First Publication Date 2022-11-17
Grant Date 2023-08-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schechter, Ian
  • Wakeling, Tim
  • Wollrath, Ann M.

Abstract

In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.

IPC Classes  ?

  • G06F 16/24 - Querying
  • G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
  • G06F 16/13 - File access structures, e.g. distributed indices
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/17 - Details of further file system functions
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

86.

Transforming a specification into a persistent computer program

      
Application Number 17858605
Grant Number 12045286
Status In Force
Filing Date 2022-07-06
First Publication Date 2022-10-27
Grant Date 2024-07-23
Owner Ab Initio Technology LLC (USA)
Inventor
  • Egenolf, Jonah
  • Isman, Marshall A.
  • Wild, Frederic

Abstract

A method performed by a computer system including: accessing a specification that specifies a plurality of modules to be implemented by the computer program for processing the one or more values of the one or more fields in the structured data item; transforming the specification into the computer program that implements the plurality of modules, wherein the transforming includes: for each of one or more first modules of the plurality of modules: identifying one or more second modules of the plurality of modules that each receive input that is at least partly based on an output of the first module; and formatting an output data format of the first module such that the first module outputs only one or more values of one or more fields of the structured data item.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 8/10 - Requirements analysisSpecification techniques
  • G06F 8/34 - Graphical or visual programming
  • G06F 16/178 - Techniques for file synchronisation in file systems
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/26 - Visual data miningBrowsing structured data

87.

GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

      
Application Number US2022018310
Publication Number 2022/187224
Status In Force
Filing Date 2022-03-01
Publication Date 2022-09-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Martin, Jonathan
  • Weiss, Adam

Abstract

Systems and methods are for executing, by a data processing system, a workflow to process results data indicating an output of a data quality test on data records by generating, responsive to receiving the results data and metadata describing the results data, a data quality issue associated with a state and one or more processing steps of the workflow to resolve a data quality error associated with the data quality test. Operations include generating a workflow for processing results data based a state specified by a data quality issue. Generating the workflow includes: assigning, based on the results data and the state of the data quality issue, an entity responsible for resolving the data quality error; determining, based on the metadata, one or more actions for satisfying the data quality condition specified in the data quality test; and updating the state associated with the data quality issue.

IPC Classes  ?

  • G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

88.

GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

      
Document Number 03208255
Status Pending
Filing Date 2022-03-01
Open to Public Date 2022-09-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Martin, Jonathan
  • Weiss, Adam

Abstract

Systems and methods are for executing, by a data processing system, a workflow to process results data indicating an output of a data quality test on data records by generating, responsive to receiving the results data and metadata describing the results data, a data quality issue associated with a state and one or more processing steps of the workflow to resolve a data quality error associated with the data quality test. Operations include generating a workflow for processing results data based a state specified by a data quality issue. Generating the workflow includes: assigning, based on the results data and the state of the data quality issue, an entity responsible for resolving the data quality error; determining, based on the metadata, one or more actions for satisfying the data quality condition specified in the data quality test; and updating the state associated with the data quality issue.

IPC Classes  ?

  • G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

89.

GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

      
Application Number 17683984
Status Pending
Filing Date 2022-03-01
First Publication Date 2022-09-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Martin, Jonathan
  • Weiss, Adam

Abstract

Systems and methods are for executing, by a data processing system, a workflow to process results data indicating an output of a data quality test on data records by generating, responsive to receiving the results data and metadata describing the results data, a data quality issue associated with a state and one or more processing steps of the workflow to resolve a data quality error associated with the data quality test. Operations include generating a workflow for processing results data based a state specified by a data quality issue. Generating the workflow includes: assigning, based on the results data and the state of the data quality issue, an entity responsible for resolving the data quality error; determining, based on the metadata, one or more actions for satisfying the data quality condition specified in the data quality test; and updating the state associated with the data quality issue.

IPC Classes  ?

  • G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
  • G06Q 10/06 - Resources, workflows, human or project managementEnterprise or organisation planningEnterprise or organisation modelling
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

90.

SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

      
Application Number US2022017581
Publication Number 2022/182800
Status In Force
Filing Date 2022-02-23
Publication Date 2022-09-01
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Polstra, Drew
  • Parks, Robert

Abstract

Techniques for managing access privileges in a data processing system include obtaining a plurality of rules for granting and/or denying privileges to a first actor to perform at least one action on a first instance of a first data entity of data entities; identifying, from among attributes of the first data entity, a first attribute whose values are used by one or more of the plurality of rules; obtaining, from a user or from at least one data store, a first value of the first attribute; identifying, using the first value and from among the plurality of rules, a first rule that depends on the first value; generating a graphical user interface (GUI) including a visual rendering of at least some of the plurality of rules, the visual rendering emphasizing the first rule identified using the first value of the first attribute; and displaying the generated GUI to the user.

IPC Classes  ?

  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
  • G06F 21/60 - Protecting data
  • H04L 9/40 - Network security protocols

91.

SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

      
Document Number 03211810
Status Pending
Filing Date 2022-02-23
Open to Public Date 2022-09-01
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Polstra, Drew
  • Parks, Robert

Abstract

Techniques for managing access privileges in a data processing system include obtaining a plurality of rules for granting and/or denying privileges to a first actor to perform at least one action on a first instance of a first data entity of data entities; identifying, from among attributes of the first data entity, a first attribute whose values are used by one or more of the plurality of rules; obtaining, from a user or from at least one data store, a first value of the first attribute; identifying, using the first value and from among the plurality of rules, a first rule that depends on the first value; generating a graphical user interface (GUI) including a visual rendering of at least some of the plurality of rules, the visual rendering emphasizing the first rule identified using the first value of the first attribute; and displaying the generated GUI to the user.

IPC Classes  ?

  • G06F 21/60 - Protecting data
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
  • H04L 9/40 - Network security protocols

92.

DYNAMIC EXECUTION OF PARAMETERIZED APPLICATIONS FOR THE PROCESSING OF KEYED NETWORK DATA STREAMS

      
Application Number US2022016457
Publication Number 2022/177904
Status In Force
Filing Date 2022-02-15
Publication Date 2022-08-25
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Rossi, Larry Paul
  • Murphy, Trevor

Abstract

A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.

IPC Classes  ?

  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/2455 - Query execution

93.

TECHNIQUES FOR MANAGING DATA IN A DATA PROCESSING SYSTEM USING DATA ENTITIES AND INHERITANCE

      
Document Number 03206016
Status Pending
Filing Date 2022-01-28
Open to Public Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gylfason, Halldor Isak
  • Radivojevic, Dusan
  • Parks, Robert
  • Weiss, Adam Harris

Abstract

Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/22 - IndexingData structures thereforStorage structures

94.

DATA PROCESSING SYSTEM WITH MANIPULATION OF LOGICAL DATASET GROUPS

      
Document Number 03209125
Status Pending
Filing Date 2022-01-31
Open to Public Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Weisman, Amit

Abstract

A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.

IPC Classes  ?

  • G06F 16/26 - Visual data miningBrowsing structured data

95.

Dataset multiplexer for data processing system

      
Application Number 17588965
Grant Number 12339829
Status In Force
Filing Date 2022-01-31
First Publication Date 2022-08-04
Grant Date 2025-06-24
Owner Ab Initio Technology LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory Christopher James
  • Becker, Matthew Douglas
  • Schechter, Ian Robert
  • Bach, Edward Alan
  • Parks, Robert

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/23 - Updating
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

96.

Data processing system with manipulation of logical dataset groups

      
Application Number 17589016
Grant Number 11971909
Status In Force
Filing Date 2022-01-31
First Publication Date 2022-08-04
Grant Date 2024-04-30
Owner Ab Initio Technology LLC (USA)
Inventor Weisman, Amit

Abstract

A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/248 - Presentation of query results
  • G06F 3/0482 - Interaction with lists of selectable items, e.g. menus

97.

TECHNIQUES FOR MANAGING DATA IN A DATA PROCESSING SYSTEM USING DATA ENTITIES AND INHERITANCE

      
Application Number US2022014232
Publication Number 2022/165123
Status In Force
Filing Date 2022-01-28
Publication Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gylfason, Halldor, Isak
  • Parks, Robert
  • Radivojevic, Dusan
  • Weiss, Adam, Harris

Abstract

Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.

IPC Classes  ?

  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/21 - Design, administration or maintenance of databases

98.

SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number US2022014270
Publication Number 2022/165146
Status In Force
Filing Date 2022-01-28
Publication Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gylfason, Halldor, Isak
  • Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

99.

DATASET MULTIPLEXER FOR DATA PROCESSING SYSTEM

      
Application Number US2022014547
Publication Number 2022/165326
Status In Force
Filing Date 2022-01-31
Publication Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory, Christopher James
  • Becker, Matthew, Douglas
  • Schechter, Ian, Robert
  • Bach, Edward, Alan
  • Parks, Robert

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

100.

SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

      
Document Number 03206880
Status Pending
Filing Date 2022-01-28
Open to Public Date 2022-08-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gylfason, Halldor Isak
  • Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  1     2     3     ...     8        Next Page