Ab Initio Technology LLC

United States of America

1-100 of 780 for Ab Initio Technology LLC

Sort by

Query


Aggregations
Jurisdiction
United States	346
Canada	224
World	210

Date
New (last 4 weeks)	14
2025 July (MTD)	2
2025 June	12
2025 March	1
2025 (YTD)	20
2024	42
2023	15
2022	34
2021	30
2020	37
Before 2020	602
See more See less
IPC Class
G06F 17/30 - Information retrieval; Database structures therefor	133
G06F 9/44 - Arrangements for executing specific programs	90
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]	72
G06F 8/34 - Graphical or visual programming	71
G06F 16/901 - IndexingData structures thereforStorage structures	55
G06F 17/00 - Digital computing or data processing equipment or methods, specially adapted for specific functions	54
G06F 11/36 - Prevention of errors by analysis, debugging or testing of software	48
G06F 16/25 - Integrating or interfacing systems involving database management systems	48
G06F 9/46 - Multiprogramming arrangements	48
G06F 8/41 - Compilation	45
G06F 16/23 - Updating	42
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models	41
G06F 9/54 - Interprogram communication	41
G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms	40
G06F 16/22 - IndexingData structures thereforStorage structures	39
G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled	38
G06F 16/2455 - Query execution	33
G06F 16/21 - Design, administration or maintenance of databases	29
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors	29
G06F 11/34 - Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation	28
G06F 11/14 - Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result	27
G06F 9/445 - Program loading or initiating	25
G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt	25
G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor	24
G06F 16/242 - Query formulation	23
G06F 8/10 - Requirements analysisSpecification techniques	18
G06F 11/30 - Monitoring	14
G06F 16/248 - Presentation of query results	13
G06F 9/45 - Compilation or interpretation of high level programme languages	13
G06Q 10/06 - Resources, workflows, human or project managementEnterprise or organisation planningEnterprise or organisation modelling	13
See more See less
Status
Pending	84
Registered / In Force	696

Found results for

patents

1 2 3 ... 8 Next Page

1. TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

Application Number	18999645
Status	Pending
Filing Date	2024-12-23
First Publication Date	2025-07-03
Owner	Ab Initio Technology LLC (USA)
Inventor	Melbouci, Sandrick Schon, Andrew

Abstract

Techniques for processing a dataset comprising data stored in fields to identify field labels. The field labels describe data stored in the dataset fields. The techniques determine whether any field labels in a field label glossary match a field. If none of the field labels in the field label glossary match the field, the techniques generate a new field label using the name of the field. The generated field label may be assigned to the field.

IPC Classes ?

G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

2. TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

Application Number	US2024061691
Publication Number	2025/144787
Status	In Force
Filing Date	2024-12-23
Publication Date	2025-07-03
Owner	AB INITIO TECHNOLOGY LLC (USA) AB INITIO SOFTWARE LLC (USA) AB INITIO ORIGINAL WORKS LLC (USA)
Inventor	Melbouci, Sandrick Schon, Andrew

Abstract

IPC Classes ?

G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
G06F 16/901 - IndexingData structures thereforStorage structures

3. DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

Application Number	18987691
Status	Pending
Filing Date	2024-12-19
First Publication Date	2025-06-26
Owner	Ab Initio Technology LLC (USA)
Inventor	Radivojevic, Dusan Egenolf, Jonah

Abstract

A method for using a development environment to automatically generate code from a multi-tiered metadata model includes: receiving a specification to process a dataset, and, in response, accessing dataset characteristics and identifying controls received from a development environment to be applied to a field of the dataset in accordance with a metadata model by: accessing a first instance of a data structure that corresponds to the dataset; based on a reference in the first instance, accessing a second instance of a data structure associated with the field; based on a reference in the second instance, accessing a third instance of a data structure associated with metadata describing the field, and based on a reference in the third instance, accessing a fourth instance of a data structure storing a control defined based on the metadata. Based on the dataset characteristics, code is generated to apply the identified control to the field.

IPC Classes ?

G06F 8/35 - Creation or generation of source code model driven
G06F 3/04842 - Selection of displayed objects or displayed text elements
G06F 8/41 - Compilation
G06F 40/30 - Semantic analysis

4. LIVE INTERVAL ANALYSIS

Application Number	US2024034454
Publication Number	2025/136448
Status	In Force
Filing Date	2024-06-18
Publication Date	2025-06-26
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Bason, Iain

Abstract

An approach to allocation of referenced objects to memory resources addresses a situation in which there are a far greater number of memory resources, for example, 216 elements in the set of memory resources, and yet the objects referenced in a program specification exceeds this number. The approach is applicable to compilation of a program specification for execution on a physical or virtual processor.

IPC Classes ?

G06F 8/41 - Compilation

5. A DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

Application Number	US2024061392
Publication Number	2025/137522
Status	In Force
Filing Date	2024-12-20
Publication Date	2025-06-26
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Radivojevic, Dusan Egenolf, Jonah

Abstract

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/23 - Updating
G06F 16/242 - Query formulation
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/26 - Visual data miningBrowsing structured data
G06F 40/30 - Semantic analysis
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

6. Live Interval Analysis

Application Number	18746432
Status	Pending
Filing Date	2024-06-18
First Publication Date	2025-06-26
Owner	Ab Initio Technology LLC (USA)
Inventor	Bason, Iain

Abstract

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

7. PARTITION-BASED ESCROW IN A DISTRIBUTED COMPUTING SYSTEM

Application Number	US2024060078
Publication Number	2025/129027
Status	In Force
Filing Date	2024-12-13
Publication Date	2025-06-19
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a processing component of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the processing component has persistently stored the first result data outside the distributed computing cluster.

IPC Classes ?

G06F 11/14 - Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result

8. Partition-based Escrow in a Distributed Computing System

Application Number	18980527
Status	Pending
Filing Date	2024-12-13
First Publication Date	2025-06-19
Owner	Ab Initio Technology LLC (USA)
Inventor	Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a consumer of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the consumer has persistently stored the first result data outside the distributed computing cluster.

IPC Classes ?

G06F 11/16 - Error detection or correction of the data by redundancy in hardware
G06F 16/23 - Updating

9. DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

Application Number	18961714
Status	Pending
Filing Date	2024-11-27
First Publication Date	2025-06-05
Owner	Ab Initio Technology LLC (USA)
Inventor	Weisman, Amit Fantasia, Cory Christopher James Parks, Robert Eads, Matthew Austin Zimmerman, William Robert Huang, Aaron Becker, Matthew Douglas

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, the physical dataset used for performing data access operations may be selected based on current context. Current context may be specified based on values of system parameters and/or user specified values. The physical dataset accessed may be identified by selecting a record from multiple records in a dataset catalog associated with the logical dataset. Each record includes information to access a physical dataset associated with the selected record and context information to indicate the context in which the specific physical dataset is to be selected.

IPC Classes ?

G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
G06F 16/901 - IndexingData structures thereforStorage structures

10. TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

Application Number	18964167
Status	Pending
Filing Date	2024-11-29
First Publication Date	2025-06-05
Owner	Ab Initio Technology LLC (USA)
Inventor	Dickie, Garth Allen Sciarappa, Luke Bryant Miller, Zachary James Schechter, Ian Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes ?

G06F 8/33 - Intelligent editors
G06F 8/34 - Graphical or visual programming

11. TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

Application Number	US2024057981
Publication Number	2025/117907
Status	In Force
Filing Date	2024-11-29
Publication Date	2025-06-05
Owner	AB INITIO TECHNOLOGY LLC (USA) AB INITIO SOFTWARE LLC (USA) AB INITIO ORIGINAL WORKS LLC (USA)
Inventor	Dickie, Garth, Allen Sciarappa, Luke, Bryant Miller, Zachary, James Schechter, Ian, Robert

Abstract

IPC Classes ?

G06F 8/34 - Graphical or visual programming
G06F 8/30 - Creation or generation of source code
G06F 8/33 - Intelligent editors
G06F 8/41 - Compilation

12. AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

Application Number	US2024058077
Publication Number	2025/117946
Status	In Force
Filing Date	2024-12-02
Publication Date	2025-06-05
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Hawkins, Miranda Isman, Marshall A. Jankovic, Maja Samoylovich, Alexander Hurd, Ben

Abstract

A computer-implemented method for defining a test for a computer program includes receiving operational data generated during execution of a computer program in a first computing environment, the operational data indicative of (i) a data source accessed by the computer program during execution of the computer program and (ii) a destination to where baseline data records are output by the computer program during execution of the computer program. Based on the received operational data, a data storage object is generated that includes (i) input data records from the data source and the baseline data records from the destination, and (ii) test definition data for the first computing environment. Responsive to migration of the computer program to a second computing environment, the input and baseline data records from the data storage object are stored in the second computing environment. A test configuration is defined for the migrated computer program in the second computing environment according to the test definition data in the data storage object and the mapping between the first computing environment and the second computing environment, the test configuration for the migrated computer program identifying a location of the input data records and a location of the baseline data records in the second computing environment. Execution of the migrated computer program in the second computing environment is tested using the input data records and baseline data records in the second computing environment and according to the defined test configuration for the migrated computer program.

IPC Classes ?

G06F 11/3668 - Testing of software

13. AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

Application Number	18962909
Status	Pending
Filing Date	2024-11-27
First Publication Date	2025-06-05
Owner	Ab Initio Technology LLC (USA)
Inventor	Hawkins, Miranda Isman, Marshall A. Jankovic, Maja Samoylovich, Alexander Hurd, Ben

Abstract

IPC Classes ?

G06F 16/36 - Creation of semantic tools, e.g. ontology or thesauri

14. DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

Application Number	US2024057602
Publication Number	2025/117635
Status	In Force
Filing Date	2024-11-27
Publication Date	2025-06-05
Owner	AB INITIO TECHNOLOGY LLC (USA) AB INITIO ORIGINAL WORKS LLC (USA) AB INITIO SOFTWARE LLC (USA)
Inventor	Weisman, Amit Fantasia, Cory, Christopher James Parks, Robert Eads, Matthew, Austin Zimmerman, William, Robert Huang, Aaron Becker, Matthew, Douglas

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

15. CONVERSION OF DATA LINEAGES

Application Number	US2024043393
Publication Number	2025/049234
Status	In Force
Filing Date	2024-08-22
Publication Date	2025-03-06
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Wagner, Philippe

Abstract

The present disclosure relates to a computer-implemented method for conversion of a first data lineage to a second data lineage, the method comprising: obtaining a first data lineage specifying relationships among physical components of a plurality of physical components; receiving an identification of a portion of the first data lineage; generating a second data lineage from the identified portion of the first data lineage, the second data lineage specifying relationships among second components of a plurality of second components, wherein the second components of the plurality of second components are associated with at least some of the physical components of the identified portion of the first data lineage. A corresponding computer-readable medium, a corresponding a data processing system, and a corresponding computer program are also described.

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases
G06F 16/904 - BrowsingVisualisation therefor
G06F 8/41 - Compilation

16. DATA SET EVALUATION BASED ON DATA LINEAGE ANALYSIS

Application Number	US2024042084
Publication Number	2025/038623
Status	In Force
Filing Date	2024-08-13
Publication Date	2025-02-20
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	De Jager, Dirk

Abstract

The present disclosure relates to a computer-implemented method, the method comprising: obtaining a data lineage whose structure specifies relationships among data sets of a plurality of data sets; analyzing the structure of the data lineage; based on a result of the analyzing, identifying a subset of the plurality of data sets for which a parameter is to be evaluated, wherein the subset includes one or more of the data sets, and wherein the parameter is for indicating a potential error within a data set; and outputting an indication of the identified subset of one or more data sets. A computer-readable medium, computer program, a corresponding data processing apparatus, and a data structure are described as well.

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

17. MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

Application Number	US2024039543
Publication Number	2025/029579
Status	In Force
Filing Date	2024-07-25
Publication Date	2025-02-06
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Chang, Andrew Oberdorf, Richard Wang, Song Smith, Nathan

Abstract

Techniques for discovering primary, unique, and/or foreign keys for relational datasets are described. The techniques include profiling the relational datasets to obtain respective data profiles; identifying one or more primary key candidates for a first relational dataset using a first data profile of the first relational dataset and a first trained machine learning model; identifying one or more foreign key proposals for a second relational dataset using the one or more primary key candidates by performing a subset analysis of the second relational dataset with respect to the first relational dataset; identifying one or more foreign key candidates for the second relational dataset using the first data profile, a second data profile of the second relational dataset, and a second trained machine learning model different from the first trained machine learning model; and outputting the at primary key candidate(s) and the foreign key candidate(s).

IPC Classes ?

G06N 20/00 - Machine learning

18. SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

Application Number	18736974
Status	Pending
Filing Date	2024-06-07
First Publication Date	2025-01-30
Owner	Ab Initio Technology LLC (USA)
Inventor	Dickie, Garth Allen

Abstract

Techniques for determining processing layouts to nodes of a dataflow graph. The techniques include: obtaining information specifying a dataflow graph, the dataflow graph comprising a plurality of nodes and a plurality of edges connecting the plurality nodes, the plurality of edges representing flows of data among nodes in the plurality of nodes, the plurality of nodes comprising: a first set of one or more nodes; and a second set of one or more nodes disjoint from the first set of nodes; obtaining a first set of one or more processing layouts for the first set of nodes; and determining a processing layout for each node in the second set of nodes based on the first set of processing layouts and one or more layout determination rules, the one or more layout determination rules including at least one rule for selecting among processing layouts having different degrees of parallelism, and information indicating that data generated by at least one node in the first and/or third set of nodes is not used by any nodes in the dataflow graph downstream from the at least one node.

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

19. MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

Application Number	18783851
Status	Pending
Filing Date	2024-07-25
First Publication Date	2025-01-30
Owner	Ab Initio Technology LLC (USA)
Inventor	Chang, Andrew Oberdorf, Richard Wang, Song Smith, Nathan

Abstract

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 40/30 - Semantic analysis

20. FOR DATA OBJECTS MANAGED BY A DATA PROCESSING SYSTEM

Application Number	18909013
Status	Pending
Filing Date	2024-10-08
First Publication Date	2025-01-23
Owner	Ab Initio Technology LLC (USA)
Inventor	Parks, Robert Yeracaris, Anthony Radivojevic, Dusan

Abstract

Techniques for using finite state machines (FSMs) to implement workflows in a data processing system comprising at least one data store storing data objects and a workflow management system (WMS). The WMS is configured to perform: determining a current value of an attribute of a first data object by accessing the current value in the at least one data store; identifying, using the current value and metadata specifying relationships among at least some of the data objects, an actor authorized to perform a workflow task for the first data object; generating a GUI through which the actor can provide the input that the workflow task is to be performed; and in response to receiving, from the actor and through the GUI, input specifying that the workflow task is to be performed: performing the workflow task; and updating the current workflow state of the first FSM to a second workflow state.

IPC Classes ?

G06Q 10/10 - Office automationTime management
G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
G06F 16/906 - ClusteringClassification

21. REUSABLE DATA PROCESSING PROGRAM GENERATION

Application Number	US2024024224
Publication Number	2024/258483
Status	In Force
Filing Date	2024-04-12
Publication Date	2024-12-19
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Weisman, Amit Pribble, Molly Keare, Avery Stahlecker, Dirk Abaya, Victor, T. Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and enabling a user to select one or more data transformation steps, applying the data transformation steps to the records, and displaying the transformed records.

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/904 - BrowsingVisualisation therefor

22. REUSABLE DATA PROCESSING PROGRAM GENERATION

Application Number	18633887
Status	Pending
Filing Date	2024-04-12
First Publication Date	2024-12-12
Owner	Ab Initio Technology LLC (USA)
Inventor	Weisman, Amit Pribble, Molly Keare, Avery Stahlecker, Dirk Abaya, Victor T. Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and iteratively enabling a user to select one or more data transformation steps, iteratively applying the data transformation steps to the records, and iteratively displaying the transformed records.

IPC Classes ?

G06F 16/23 - Updating

23. MIGRATION OF DATASETS AMONG FEDERATED DATABASE SYSTEMS

Application Number	US2024028621
Publication Number	2024/233801
Status	In Force
Filing Date	2024-05-09
Publication Date	2024-11-14
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Bach, Edward, Alan Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor

24. Migration of datasets among federated database systems

Application Number	18442567
Grant Number	12346288
Status	In Force
Filing Date	2024-02-15
First Publication Date	2024-11-14
Grant Date	2025-07-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Bach, Edward Alan Eads, Matthew

Abstract

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/11 - File system administration, e.g. details of archiving or snapshots
G06F 16/13 - File access structures, e.g. distributed indices
G06F 16/25 - Integrating or interfacing systems involving database management systems

25. Meta-data driven ingestion and data processing

Application Number	18496543
Grant Number	12346350
Status	In Force
Filing Date	2023-10-27
First Publication Date	2024-10-17
Grant Date	2025-07-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Isman, Marshall A. Weiss, Adam Egenolf, Jonah Parks, Robert Maclean, John Mellon, Richard Radivojevic, Dusan Veiser, Paul Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes ?

G06F 16/20 - Information retrievalDatabase structures thereforFile system structures therefor of structured data, e.g. relational data
G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
G06F 9/451 - Execution arrangements for user interfaces
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

26. METADATA DRIVEN INGESTION AND DATA PROCESSING

Application Number	US2024024348
Publication Number	2024/216088
Status	In Force
Filing Date	2024-04-12
Publication Date	2024-10-17
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Isman, Marshall A. Weiss, Adam Egenolf, Jonah Parks, Robert Maclean, John Mellon, Richard Radivojevic, Dusan Veiser, Paul Khader, Mazin Woodrow

Abstract

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases

27. Systems and methods for accessing data entities managed by a data processing system

Application Number	18415067
Grant Number	12254002
Status	In Force
Filing Date	2024-01-17
First Publication Date	2024-10-17
Grant Date	2025-03-18
Owner	Ab Initio Technology LLC (USA)
Inventor	Gylfason, Halldor Isak Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes ?

G06F 16/242 - Query formulation
G06F 9/54 - Interprogram communication
G06F 16/248 - Presentation of query results

28. On-demand integration of records with data catalog identifiers

Application Number	18491324
Grant Number	12339826
Status	In Force
Filing Date	2023-10-20
First Publication Date	2024-10-10
Grant Date	2025-06-24
Owner	Ab Initio Technology LLC (USA)
Inventor	Gould, Joel Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/21 - Design, administration or maintenance of databases
G06F 16/22 - IndexingData structures thereforStorage structures

29. ON-DEMAND INTEGRATION OF RECORDS WITH DATA CATALOG IDENTIFIERS

Application Number	US2024023227
Publication Number	2024/211674
Status	In Force
Filing Date	2024-04-05
Publication Date	2024-10-10
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gould, Joel Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields of datasets from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes ?

G06F 16/242 - Query formulation
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/9035 - Filtering based on additional data, e.g. user or group profiles
G06F 16/9535 - Search customisation based on user profiles and personalisation

30. Logical Access for Previewing Expanded View Datasets

Application Number	18492904
Status	Pending
Filing Date	2023-10-24
First Publication Date	2024-09-26
Owner	Ab Initio Technology LLC (USA)
Inventor	Parks, Robert Egenolf, Jonah Schechter, Ian

Abstract

A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

IPC Classes ?

G06F 16/2455 - Query execution
G06F 16/2453 - Query optimisation
G06F 16/2457 - Query processing with adaptation to user needs

31. USING METADATA TO DRIVE BATCH AND REAL-TIME PROCESSING IN DATA PROCESSING SYSTEMS

Application Number	US2024020140
Publication Number	2024/196759
Status	In Force
Filing Date	2024-03-15
Publication Date	2024-09-26
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Isman, Marshall A. Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a data record (e.g., a dynamic data record) based on a request for the real-time decisioning, wherein the data record includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases

32. LOGICAL ACCESS FOR PREVIEWING EXPANDED VIEW DATASETS

Application Number	US2024021174
Publication Number	2024/197264
Status	In Force
Filing Date	2024-03-22
Publication Date	2024-09-26
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Parks, Robert Egenolf, Jonah Schechter, Ian

Abstract

IPC Classes ?

G06F 16/26 - Visual data miningBrowsing structured data
G06F 16/957 - Browsing optimisation, e.g. caching or content distillation
G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

33. Using Metadata to Drive Batch and Real-Time Processing in Data Processing Systems

Application Number	18491601
Status	Pending
Filing Date	2023-10-20
First Publication Date	2024-09-19
Owner	Ab Initio Technology LLC (USA)
Inventor	Isman, Marshall A. Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a record (e.g., dynamic record) based on a request for the real-time decisioning, wherein the record (e.g., dynamic record) includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes ?

G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/23 - Updating

34. SYSTEMS AND METHODS FOR DATAFLOW GRAPH OPTIMIZATION

Application Number	18670461
Status	Pending
Filing Date	2024-05-21
First Publication Date	2024-09-19
Owner	Ab Initio Technology LLC (USA)
Inventor	Dickie, Garth Allen

Abstract

At least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by at least one computer hardware processor, cause the at least one computer hardware processor to perform: obtaining an automatically generated initial dataflow graph, the initial dataflow graph comprising a first plurality of nodes representing a first plurality of data processing operations and a first plurality of links representing flows of data among nodes in the first plurality of nodes; and generating an updated dataflow graph by iteratively applying dataflow graph optimization rules to update the initial dataflow graph, the updated dataflow graph comprising a second plurality of nodes representing a second plurality of data processing operations and a second plurality of links representing flows of data among nodes in the second plurality of nodes.

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/23 - Updating
G06F 16/242 - Query formulation

35. BACK-CALCULATION OF AGGREGATES

Application Number	US2024014447
Publication Number	2024/167838
Status	In Force
Filing Date	2024-02-05
Publication Date	2024-08-15
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gould, Joel Roche, Benjamin James Robbie

Abstract

Described are techniques for back-calculating one or more values of a new, real-time aggregate before sufficient data to calculate the new, real-time aggregate has been collected, wherein the back-calculating is based on data collected for one or more aggregates that have been executing prior to start of execution of the new, real-time aggregate.

IPC Classes ?

G06F 17/18 - Complex mathematical operations for evaluating statistical data
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

36. Processing Structured Data for Real-time Segmentation

Application Number	18486300
Status	Pending
Filing Date	2023-10-13
First Publication Date	2024-08-08
Owner	Ab Initio Technology LLC (USA)
Inventor	Gould, Joel Roche, Benjamin James Robbie Banks, Matthew John

Abstract

A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems
H04L 67/55 - Push-based network services

37. Back-Calculation of Aggregates

Application Number	18233418
Status	Pending
Filing Date	2023-08-14
First Publication Date	2024-08-08
Owner	Ab Initio Technology LLC (USA)
Inventor	Gould, Joel Roche, Benjamin James Robbie

Abstract

IPC Classes ?

G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/242 - Query formulation

38. PROCESSING STRUCTURED DATA FOR REAL-TIME SEGMENTATION

Application Number	US2024014305
Publication Number	2024/163937
Status	In Force
Filing Date	2024-02-02
Publication Date	2024-08-08
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gould, Joel Roche, Benjamin James Robbie Banks, Matthew John

Abstract

IPC Classes ?

G06Q 40/12 - Accounting
G06F 16/23 - Updating

39. ON-DEMAND RETRIEVAL OF STRUCTURED DATA IN AGGREGATING DATA ACROSS DISTINCT SOURCES

Application Number	US2024012793
Publication Number	2024/158920
Status	In Force
Filing Date	2024-01-24
Publication Date	2024-08-02
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gould, Joel

Abstract

A method for enabling a user to generate a complex aggregation on their own by providing the user with a graphical user interface that displays data items in a data catalog and that provides controls for the user to select data items to be used in generating the complex aggregation, and to select a type of aggregation, and based on the user's selections, automatically generating computer instructions to generate a value of the complex aggregation is described.

IPC Classes ?

G06F 16/11 - File system administration, e.g. details of archiving or snapshots
G06F 16/16 - File or folder operations, e.g. details of user interfaces specifically adapted to file systems
G06F 8/34 - Graphical or visual programming
G06F 9/44 - Arrangements for executing specific programs

40. DATA PROCESSING SYSTEM WITH MANIPULATION OF LOGICAL DATASET GROUPS

Application Number	18434546
Status	Pending
Filing Date	2024-02-06
First Publication Date	2024-08-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Weisman, Amit

Abstract

A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.

IPC Classes ?

G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 3/0482 - Interaction with lists of selectable items, e.g. menus
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/248 - Presentation of query results

41. On-Demand Retrieval of Structured Data in Aggregating Data Across Distinct Sources

Application Number	18338180
Status	Pending
Filing Date	2023-06-20
First Publication Date	2024-07-25
Owner	Ab Initio Technology LLC (USA)
Inventor	Gould, Joel

Abstract

IPC Classes ?

G06F 16/248 - Presentation of query results
G06F 9/451 - Execution arrangements for user interfaces
G06F 16/2455 - Query execution
G06F 16/25 - Integrating or interfacing systems involving database management systems

42. Display panel portion with an animated computer icon

Application Number	29831258
Grant Number	D1032627
Status	In Force
Filing Date	2022-03-18
First Publication Date	2024-06-25
Grant Date	2024-06-25
Owner	Ab Initio Technology LLC (USA)
Inventor	Nielsen, Bo Hjort

43. Display panel portion with an animated computer icon

Application Number	29831275
Grant Number	D1032628
Status	In Force
Filing Date	2022-03-18
First Publication Date	2024-06-25
Grant Date	2024-06-25
Owner	Ab Initio Technology LLC (USA)
Inventor	Nielsen, Bo Hjort

44. Display panel portion with an animated computer icon

Application Number	29831244
Grant Number	D1031741
Status	In Force
Filing Date	2022-03-18
First Publication Date	2024-06-18
Grant Date	2024-06-18
Owner	Ab Initio Technology LLC (USA)
Inventor	Nielsen, Bo Hjort

45. Display panel portion with an animated computer icon

Application Number	29831176
Grant Number	D1031740
Status	In Force
Filing Date	2022-03-17
First Publication Date	2024-06-18
Grant Date	2024-06-18
Owner	Ab Initio Technology LLC (USA)
Inventor	Nielsen, Bo Hjort

46. Generating rules for data processing values of data fields from semantic labels of the data fields

Application Number	18545416
Grant Number	12242444
Status	In Force
Filing Date	2023-12-19
First Publication Date	2024-05-09
Grant Date	2025-03-04
Owner	Ab Initio Technology LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06N 5/04 - Inference or reasoning models
G06N 20/00 - Machine learning

47. SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

Application Number	18278264
Status	Pending
Filing Date	2022-02-23
First Publication Date	2024-05-02
Owner	Ab Initio Technology LLC (USA)
Inventor	Polstra, Drew Parks, Robert

Abstract

Techniques for managing access privileges in a data processing system include obtaining a plurality of rules for granting and/or denying privileges to a first actor to perform at least one action on a first instance of a first data entity of data entities; identifying, from among attributes of the first data entity, a first attribute whose values are used by one or more of the plurality of rules; obtaining, from a user or from at least one data store, a first value of the first attribute; identifying, using the first value and from among the plurality of rules, a first rule that depends on the first value; generating a graphical user interface (GUI) including a visual rendering of at least some of the plurality of rules, the visual rendering emphasizing the first rule identified using the first value of the first attribute; and displaying the generated GUI to the user.

IPC Classes ?

H04L 9/40 - Network security protocols
H04L 41/22 - Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]

48. Generating rules for data processing values of data fields from semantic labels of the data fields

Application Number	18399522
Grant Number	12242442
Status	In Force
Filing Date	2023-12-28
First Publication Date	2024-04-18
Grant Date	2025-03-04
Owner	Ab Initio Technology LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Melbouci, Sandrick

Abstract

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06N 5/04 - Inference or reasoning models
G06N 20/00 - Machine learning

49. Generating rules for data processing values of data fields from semantic labels of the data fields

Application Number	18399545
Grant Number	12242443
Status	In Force
Filing Date	2023-12-28
First Publication Date	2024-04-18
Grant Date	2025-03-04
Owner	Ab Initio Technology LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Melbouci, Sandrick

Abstract

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06N 5/04 - Inference or reasoning models
G06N 20/00 - Machine learning

50. Dynamic distribution of container images

Application Number	18492173
Grant Number	12307262
Status	In Force
Filing Date	2023-10-23
First Publication Date	2024-04-18
Grant Date	2025-05-20
Owner	Ab Initio Technology LLC (USA)
Inventor	Lynch, Frank Wakeling, Tim

Abstract

A method implemented by a data processing system including: accessing the container image that includes the first application and a second application; determining, by the data processing system, the number of parallel executions of the given module of the first application; for the given module, generating a plurality of instances of the container image in accordance with the number of parallel executions determined, for each instance, configuring that instance to execute the given module of the first application; causing each of the plurality of configured instances to execute on one or more of the host systems; and for at least one of the plurality of configured instances, causing, by the second application of that configured instance, communication between the data processing system and the one or more of the host systems executing that configured instance.

IPC Classes ?

G06F 9/445 - Program loading or initiating
G06F 8/61 - Installation
G06F 9/451 - Execution arrangements for user interfaces
G06F 9/455 - EmulationInterpretationSoftware simulation, e.g. virtualisation or emulation of application or operating system execution engines
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

51. Generation of optimized logic from a schema

Application Number	18345852
Grant Number	12346312
Status	In Force
Filing Date	2023-06-30
First Publication Date	2024-04-18
Grant Date	2025-07-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Egenolf, Jonah Isman, Marshall A. Schechter, Ian

Abstract

A method includes accessing a schema that specifies relationships among datasets, computations on the datasets, or transformations of the datasets, selecting a dataset from among the datasets, and identifying, from the schema, other datasets that are related to the selected dataset. Attributes of the datasets are identified, and logical data representing the identified attributes and relationships among the attributes is generated. The logical data is provided to a development environment, which provides access to portions of the logical data representing the identified attributes. A specification that specifies at least one of the identified attributes in performing an operation is received from the development environment. Based on the specification and the relationships among the identified attributes represented by the logical data, a computer program is generated to perform the operation by accessing, from storage, at least one dataset having the at least one of the attributes specified in the specification.

IPC Classes ?

G06F 8/34 - Graphical or visual programming
G06F 8/36 - Software reuse
G06F 8/38 - Creation or generation of source code for implementing user interfaces
G06F 16/21 - Design, administration or maintenance of databases
G06F 16/23 - Updating
G06F 16/242 - Query formulation
G06F 16/2452 - Query translation
G06F 16/2453 - Query optimisation
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
G06F 16/901 - IndexingData structures thereforStorage structures
G06Q 10/10 - Office automationTime management
G06Q 30/0242 - Determining effectiveness of advertisements

52. TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

Application Number	US2023074617
Publication Number	2024/064705
Status	In Force
Filing Date	2023-09-19
Publication Date	2024-03-28
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Joyce, John Huang, David Chang, Andrew Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously- determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

53. TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

Document Number	03268252
Status	Pending
Filing Date	2023-09-19
Open to Public Date	2024-03-28
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Joyce, John Huang, David Chang, Andrew Morrison, Niel

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

54. Publishing to a data warehouse

Application Number	18492425
Grant Number	12277135
Status	In Force
Filing Date	2023-10-23
First Publication Date	2024-03-28
Grant Date	2025-04-15
Owner	Ab Initio Technology LLC (USA)
Inventor	Blom, Andrew Miller, Darren Isman, Marshall A.

Abstract

A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.

IPC Classes ?

G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
G06F 8/34 - Graphical or visual programming
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 17/00 - Digital computing or data processing equipment or methods, specially adapted for specific functions
H04L 67/565 - Conversion or adaptation of application format or content

55. Techniques for discovering and updating semantic meaning of data fields

Application Number	18470405
Grant Number	12141107
Status	In Force
Filing Date	2023-09-19
First Publication Date	2024-03-21
Grant Date	2024-11-12
Owner	Ab Initio Technology LLC (USA)
Inventor	Joyce, John Huang, David Chang, Andrew Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously-determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/21 - Design, administration or maintenance of databases
G06F 40/30 - Semantic analysis

56. TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

Application Number	US2023070643
Publication Number	2024/054726
Status	In Force
Filing Date	2023-07-20
Publication Date	2024-03-14
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Clemens, David Procops, Roy

Abstract

Some embodiments provide techniques of enforcing valid data assignments in a data processing system in which data can be dynamically updated by user devices and/or computerized processes. The techniques identify, using a validation rule associated with a data entity, one or more valid values for assignment to an attribute of an instance of the data entity. The techniques identify the valid value(s) by generating a query for the one or more valid values using one or more condition(s) on the attribute in the validation rule, and executing the generated query to obtain the one or more valid values for the first attribute. The attribute may then be assigned one or more of the identified valid value(s).

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

57. TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

Application Number	18356153
Status	Pending
Filing Date	2023-07-20
First Publication Date	2024-03-14
Owner	Ab Initio Technology LLC (USA)
Inventor	Clemens, David Procops, Roy

Abstract

IPC Classes ?

G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2453 - Query optimisation
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

58. GENERATING, ACCESSING, AND DISPLAYING LINEAGE METADATA

Application Number	18345706
Status	Pending
Filing Date	2023-06-30
First Publication Date	2024-03-07
Owner	Ab Initio Technology LLC (USA)
Inventor	Clemens, David Radivojevic, Dusan Galarneau, Neil

Abstract

Among other things, we describe a method of receiving a portion of metadata from a data source, the portion of metadata describing nodes and edges; generating instances of a data structure representing the portion of metadata, at least one instance of the data structure including an identification value that identifies a corresponding node, one or more property values representing respective properties of the corresponding node, and one or more pointers to respective identification values, each pointer representing an edge associated with a node identified by the corresponding respective identification value; storing the instances of the data structure in random access memory; receiving a query that includes an identification of at least one particular element of data; and using at least one instance of the data structure to cause a display of a computer system to display a representation of lineage of the particular element of data.

IPC Classes ?

G06F 16/245 - Query processing
G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/248 - Presentation of query results
G06F 16/83 - Querying
G06F 40/117 - TaggingMarking up Designating a blockSetting of attributes

59. OPERATIONALIZING METADATA

Application Number	US2023029810
Publication Number	2024/044041
Status	In Force
Filing Date	2023-08-09
Publication Date	2024-02-29
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Parks, Robert Egenolf, Jonah

Abstract

A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

60. OPERATIONALIZING METADATA

Document Number	03265601
Status	Pending
Filing Date	2023-08-09
Open to Public Date	2024-02-29
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Parks, Robert Egenolf, Jonah

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

61. OPERATIONALIZING METADATA

Application Number	18104066
Status	Pending
Filing Date	2023-01-31
First Publication Date	2024-02-29
Owner	Ab Initio Technology LLC (USA)
Inventor	Parks, Robert Egenolf, Jonah

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/26 - Visual data miningBrowsing structured data
G06F 16/901 - IndexingData structures thereforStorage structures

62. Editor for generating computational graphs

Application Number	18112958
Grant Number	12050606
Status	In Force
Filing Date	2023-02-22
First Publication Date	2024-01-25
Grant Date	2024-07-30
Owner	Ab Initio Technology LLC (USA)
Inventor	Schechter, Ian Dickie, Garth

Abstract

Techniques for generating a dataflow graph include generating a first dataflow graph with a plurality of first nodes representing first computer operations in processing data, with at least one of the first computer operations being a declarative operation that specifies one or more characteristics of one or more results of processing of data, and transforming the first dataflow graph into a second dataflow graph for processing data in accordance with the first computer operations, the second dataflow graph including a plurality of second nodes representing second computer operations, with at least one of the second nodes representing one or more imperative operations that implement the logic specified by the declarative operation, where the one or more imperative operations are unrepresented by the first nodes in the first dataflow graph.

IPC Classes ?

G06F 16/2455 - Query execution
G06F 9/30 - Arrangements for executing machine instructions, e.g. instruction decode
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
G06F 16/242 - Query formulation
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/901 - IndexingData structures thereforStorage structures

63. DISCOVERING A SEMANTIC MEANING OF DATA FIELDS FROM PROFILE DATA OF THE DATA FIELDS

Application Number	18201545
Status	Pending
Filing Date	2023-05-24
First Publication Date	2023-12-21
Owner	Ab Initio Technology LLC (USA)
Inventor	Butler, Christopher Thurston Bush, Timothy Spencer

Abstract

A data processing system for discovering a semantic meaning of a field included in one or more data sets is configured to identify a field included in one or more data sets, with the field having an identifier. For that field, the system profiles data values of the field to generate a data profile, accesses a plurality of label proposal tests, and generates a set of label proposals by applying the plurality of label proposal tests to the data profile. The system determines a similarity among the label proposals and selects a classification. The system identifies one of the label proposals as identifying the semantic meaning. The system stores the identifier of the field with the identified one of the label proposals that identifies the semantic meaning.

IPC Classes ?

G06F 40/30 - Semantic analysis
G06F 16/93 - Document management systems
G06N 20/00 - Machine learning
G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

64. DATAFLOW GRAPH DATASETS

Document Number	03256554
Status	Pending
Filing Date	2023-02-24
Open to Public Date	2023-11-09
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Schechter, Ian Robert Dickie, Garth Allen Egenolf, Jonah Isman, Marshall

IPC Classes ?

G06F 8/34 - Graphical or visual programming
G06F 8/36 - Software reuse

65. DATAFLOW GRAPH DATASETS

Application Number	18114212
Status	Pending
Filing Date	2023-02-24
First Publication Date	2023-11-09
Owner	Ab Initio Technology LLC (USA)
Inventor	Schechter, Ian Robert Dickie, Garth Allen Egenolf, Jonah Isman, Marshall

Abstract

Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures

66. DATAFLOW GRAPH DATASETS

Application Number	US2023013841
Publication Number	2023/215028
Status	In Force
Filing Date	2023-02-24
Publication Date	2023-11-09
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Schechter, Ian, Robert Dickie, Garth, Allen Egenolf, Jonah Isman, Marshall

Abstract

IPC Classes ?

G06F 8/34 - Graphical or visual programming
G06F 8/36 - Software reuse

67. Dynamic execution of parameterized applications for the processing of keyed network data streams

Application Number	18316333
Grant Number	11983548
Status	In Force
Filing Date	2023-05-12
First Publication Date	2023-09-07
Grant Date	2024-05-14
Owner	Ab Initio Technology LLC (USA)
Inventor	Ravid, Oded Murphy, Trevor

Abstract

A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.

IPC Classes ?

G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
G06F 9/445 - Program loading or initiating
G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
G06F 16/178 - Techniques for file synchronisation in file systems
G06F 16/2455 - Query execution
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 8/41 - Compilation

68. CHARACTERIZING DATA SOURCES IN A DATA STORAGE SYSTEM

Application Number	17860568
Status	Pending
Filing Date	2022-07-08
First Publication Date	2023-06-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Anderson, Arlen

Abstract

Characterizing data includes: reading data from an interface to a data storage system, and storing two or more sets of summary data summarizing data stored in different respective data sources in the data storage system; and processing the stored sets of summary data to generate system information characterizing data from multiple data sources in the data storage system. The processing includes: analyzing the stored sets of summary data to select two or more data sources that store data satisfying predetermined criteria, and generating the system information including information identifying a potential relationship between fields of records included in different data sources based at least in part on comparison between values from a stored set of summary data summarizing a first of the selected data sources and values from a stored set of summary data summarizing a second of the selected data sources.

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/23 - Updating

69. AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

Document Number	03233392
Status	Pending
Filing Date	2022-10-06
Open to Public Date	2023-04-13
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Bach, Edward Alan

Abstract

A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

IPC Classes ?

G06F 8/76 - Adapting program code to run in a different environmentPorting

70. AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

Application Number	US2022077666
Publication Number	2023/060170
Status	In Force
Filing Date	2022-10-06
Publication Date	2023-04-13
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Bach, Edward Alan

Abstract

IPC Classes ?

G06F 8/76 - Adapting program code to run in a different environmentPorting

71. AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

Application Number	17704469
Status	Pending
Filing Date	2022-03-25
First Publication Date	2023-04-13
Owner	Ab Initio Technology LLC (USA)
Inventor	Joyce, John Isman, Marshall A. Bach, Edward Alan

Abstract

IPC Classes ?

G06F 8/65 - Updates
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor
H04L 67/06 - Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
H04L 67/00 - Network arrangements or protocols for supporting network services or applications

72. SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

Application Number	US2022045358
Publication Number	2023/056003
Status	In Force
Filing Date	2022-09-30
Publication Date	2023-04-06
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Dickie, Garth, Allen

Abstract

IPC Classes ?

G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt
G06F 8/41 - Compilation
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

73. SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

Application Number	17957646
Status	Pending
Filing Date	2022-09-30
First Publication Date	2023-03-30
Owner	AB INITIO SOFTWARE LLC (USA) AB INITIO ORIGINAL WORKS LLC (USA) AB INITIO TECHNOLOGY LLC (USA)
Inventor	Dickie, Garth Allen

Abstract

IPC Classes ?

G06F 16/248 - Presentation of query results
G06F 16/22 - IndexingData structures thereforStorage structures

74. METADATA-DRIVEN DATA INGESTION

Application Number	17665109
Status	Pending
Filing Date	2022-02-04
First Publication Date	2023-03-30
Owner	Ab Initio Technology LLC (USA)
Inventor	Radivojevic, Dusan Parks, Robert Weiss, Adam Jankovic, Maja Vickery, John

Abstract

An electronic system for increasing the speed of preparing data with a specified data quality for storage by automatically identifying for a user, with minimal user input, common contexts among (i) fields in disparate datasets, and (ii) names the user has specified as potentially describing the fields, and by using those common contexts to govern the disparate datasets prior to storage to ensure the specified data quality.

IPC Classes ?

G06F 3/06 - Digital input from, or digital output to, record carriers

75. METADATA-DRIVEN DATA INGESTION

Document Number	03230753
Status	Pending
Filing Date	2022-09-16
Open to Public Date	2023-03-23
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Radivojevic, Dusan Parks, Robert Weiss, Adam Jankovic, Maja Vickery, John

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

76. METADATA-DRIVEN DATA INGESTION

Application Number	US2022076595
Publication Number	2023/044445
Status	In Force
Filing Date	2022-09-16
Publication Date	2023-03-23
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Radivojevic, Dusan Parks, Robert Weiss, Adam Jankovic, Maja Vickery, John

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

77. Database Security

Application Number	17987387
Status	Pending
Filing Date	2022-11-15
First Publication Date	2023-03-09
Owner	Ab Initio Technology LLC (USA)
Inventor	Winters, Christopher J.

Abstract

A method includes automatically determining a component of a security label for each first record in a first table of a database having multiple tables, including: identifying a second record related to the first record according to a foreign key relationship; identifying a component of the security label for the second record; and assigning a value for the component of the security label for the first record based on the identified component of the security label for the second record. The method includes storing the determined security label in the record.

IPC Classes ?

G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
G06F 21/60 - Protecting data
G06F 16/23 - Updating
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

78. DATA GOVERNANCE SYSTEMS AND METHODS

Document Number	03223528
Status	Pending
Filing Date	2022-06-07
Open to Public Date	2022-12-22
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Franquin, Pierre Krigelman, Ken Schon, Andy Voshell, Justin

Abstract

Some embodiments relate to a method for use in connection with governance of a plurality of data assets managed by a data processing system, the method comprising: using at least one computer hardware processor to perform: accessing a data governance policy comprising a first data standard (e.g., by obtaining information about the first standard stored in a database system); generating a first data asset collection at least in part by automatically selecting, from among the plurality of data assets managed by the data processing system and using at least one data asset criterion, one or more data assets that meet the at least one data asset criterion; associating the first data asset collection with the first data standard; and verifying whether at least one of the one or more data assets in the first data asset collection complies with the first data standard.

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

79. DATA GOVERNANCE SYSTEMS AND METHODS

Application Number	US2022032485
Publication Number	2022/265884
Status	In Force
Filing Date	2022-06-07
Publication Date	2022-12-22
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Franquin, Pierre Krigelman, Ken Schon, Andy Voshell, Justin

Abstract

IPC Classes ?

G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

80. Data governance systems and methods

Application Number	17834492
Grant Number	12287893
Status	In Force
Filing Date	2022-06-07
First Publication Date	2022-12-15
Grant Date	2025-04-29
Owner	Ab Initio Technology LLC (USA)
Inventor	Franquin, Pierre Krigelman, Ken Schon, Andrew Voshell, Justin

Abstract

IPC Classes ?

G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/2455 - Query execution
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

81. DYNAMIC COMPUTATIONAL RESOURCE ALLOCATION

Document Number	03217116
Status	Pending
Filing Date	2022-06-03
Open to Public Date	2022-12-08
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Schmidt, Stephen J. Ricardo, David Stanfill, Craig W. Chu, Jacky Cheuk Kei Vishniac, Ephraim Meriwether

Abstract

A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

82. Dynamic Computational Resource Allocation

Application Number	17831978
Status	Pending
Filing Date	2022-06-03
First Publication Date	2022-12-08
Owner	Ab Initio Technology LLC (USA)
Inventor	Schmidt, Stephen J. Ricardo, David Stanfill, Craig W. Chu, Jacky Cheuk Kei Vishniac, Ephraim Meriwether

Abstract

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

83. DYNAMIC COMPUTATIONAL RESOURCE ALLOCATION

Application Number	US2022032139
Publication Number	2022/256638
Status	In Force
Filing Date	2022-06-03
Publication Date	2022-12-08
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Schmidt, Stephen, J. Ricardo, David Stanfill, Craig, W. Chu, Jacky, Cheuk Kei Vishniac, Ephraim, Meriwether

Abstract

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

84. SYSTEMS AND METHODS FOR DETERMINING RELATIONSHIPS AMONG DATA ELEMENTS

Application Number	17576572
Status	Pending
Filing Date	2022-01-14
First Publication Date	2022-11-24
Owner	Ab Initio Technology LLC (USA)
Inventor	Gould, Joel Radivojevic, Dusan

Abstract

A data processing system configured to perform: obtaining a first data lineage representing relationships among physical data elements, the first data lineage being generated at least in part by performing at least one of: (a) analyzing source code of at least one computer program configured to access the physical data elements; and (b) analyzing information obtained during runtime of the at least one computer program; obtaining, based on user input, a second data lineage representing relationships among business data elements; obtaining an association between at least some of the physical data elements of the first data lineage and at least some of the business data elements of the second data lineage; and generating, based on the association between the physical data elements and the business data elements, an indication of agreement or discrepancy between the first data lineage and the second data lineage.

IPC Classes ?

G06F 16/23 - Updating
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 11/36 - Prevention of errors by analysis, debugging or testing of software

85. Processing data from multiple sources

Application Number	17878106
Grant Number	11720583
Status	In Force
Filing Date	2022-08-01
First Publication Date	2022-11-17
Grant Date	2023-08-08
Owner	Ab Initio Technology LLC (USA)
Inventor	Schechter, Ian Wakeling, Tim Wollrath, Ann M.

Abstract

In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.

IPC Classes ?

G06F 16/24 - Querying
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/13 - File access structures, e.g. distributed indices
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/17 - Details of further file system functions
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

86. Transforming a specification into a persistent computer program

Application Number	17858605
Grant Number	12045286
Status	In Force
Filing Date	2022-07-06
First Publication Date	2022-10-27
Grant Date	2024-07-23
Owner	Ab Initio Technology LLC (USA)
Inventor	Egenolf, Jonah Isman, Marshall A. Wild, Frederic

Abstract

A method performed by a computer system including: accessing a specification that specifies a plurality of modules to be implemented by the computer program for processing the one or more values of the one or more fields in the structured data item; transforming the specification into the computer program that implements the plurality of modules, wherein the transforming includes: for each of one or more first modules of the plurality of modules: identifying one or more second modules of the plurality of modules that each receive input that is at least partly based on an output of the first module; and formatting an output data format of the first module such that the first module outputs only one or more values of one or more fields of the structured data item.

IPC Classes ?

G06F 16/901 - IndexingData structures thereforStorage structures
G06F 8/10 - Requirements analysisSpecification techniques
G06F 8/34 - Graphical or visual programming
G06F 16/178 - Techniques for file synchronisation in file systems
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/26 - Visual data miningBrowsing structured data

87. GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

Application Number	US2022018310
Publication Number	2022/187224
Status	In Force
Filing Date	2022-03-01
Publication Date	2022-09-09
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Martin, Jonathan Weiss, Adam

Abstract

Systems and methods are for executing, by a data processing system, a workflow to process results data indicating an output of a data quality test on data records by generating, responsive to receiving the results data and metadata describing the results data, a data quality issue associated with a state and one or more processing steps of the workflow to resolve a data quality error associated with the data quality test. Operations include generating a workflow for processing results data based a state specified by a data quality issue. Generating the workflow includes: assigning, based on the results data and the state of the data quality issue, an entity responsible for resolving the data quality error; determining, based on the metadata, one or more actions for satisfying the data quality condition specified in the data quality test; and updating the state associated with the data quality issue.

IPC Classes ?

G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

88. GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

Document Number	03208255
Status	Pending
Filing Date	2022-03-01
Open to Public Date	2022-09-09
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Martin, Jonathan Weiss, Adam

Abstract

IPC Classes ?

G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

89. GENERATION AND EXECUTION OF PROCESSING WORKFLOWS FOR CORRECTING DATA QUALITY ISSUES IN DATA SETS

Application Number	17683984
Status	Pending
Filing Date	2022-03-01
First Publication Date	2022-09-01
Owner	Ab Initio Technology LLC (USA)
Inventor	Martin, Jonathan Weiss, Adam

Abstract

IPC Classes ?

G06F 11/07 - Responding to the occurrence of a fault, e.g. fault tolerance
G06Q 10/06 - Resources, workflows, human or project managementEnterprise or organisation planningEnterprise or organisation modelling
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

90. SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

Application Number	US2022017581
Publication Number	2022/182800
Status	In Force
Filing Date	2022-02-23
Publication Date	2022-09-01
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Polstra, Drew Parks, Robert

Abstract

IPC Classes ?

G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
G06F 21/60 - Protecting data
H04L 9/40 - Network security protocols

91. SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

Document Number	03211810
Status	Pending
Filing Date	2022-02-23
Open to Public Date	2022-09-01
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Polstra, Drew Parks, Robert

Abstract

IPC Classes ?

G06F 21/60 - Protecting data
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
H04L 9/40 - Network security protocols

92. DYNAMIC EXECUTION OF PARAMETERIZED APPLICATIONS FOR THE PROCESSING OF KEYED NETWORK DATA STREAMS

Application Number	US2022016457
Publication Number	2022/177904
Status	In Force
Filing Date	2022-02-15
Publication Date	2022-08-25
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gould, Joel Rossi, Larry Paul Murphy, Trevor

Abstract

IPC Classes ?

G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
G06F 16/901 - IndexingData structures thereforStorage structures
G06F 16/2455 - Query execution

93. TECHNIQUES FOR MANAGING DATA IN A DATA PROCESSING SYSTEM USING DATA ENTITIES AND INHERITANCE

Document Number	03206016
Status	Pending
Filing Date	2022-01-28
Open to Public Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gylfason, Halldor Isak Radivojevic, Dusan Parks, Robert Weiss, Adam Harris

Abstract

Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.

IPC Classes ?

G06F 16/21 - Design, administration or maintenance of databases
G06F 16/22 - IndexingData structures thereforStorage structures

94. DATA PROCESSING SYSTEM WITH MANIPULATION OF LOGICAL DATASET GROUPS

Document Number	03209125
Status	Pending
Filing Date	2022-01-31
Open to Public Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Weisman, Amit

Abstract

IPC Classes ?

G06F 16/26 - Visual data miningBrowsing structured data

95. Dataset multiplexer for data processing system

Application Number	17588965
Grant Number	12339829
Status	In Force
Filing Date	2022-01-31
First Publication Date	2022-08-04
Grant Date	2025-06-24
Owner	Ab Initio Technology LLC (USA)
Inventor	Weisman, Amit Fantasia, Cory Christopher James Becker, Matthew Douglas Schechter, Ian Robert Bach, Edward Alan Parks, Robert

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.

IPC Classes ?

G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
G06F 16/21 - Design, administration or maintenance of databases
G06F 16/23 - Updating
G06F 16/25 - Integrating or interfacing systems involving database management systems

96. Data processing system with manipulation of logical dataset groups

Application Number	17589016
Grant Number	11971909
Status	In Force
Filing Date	2022-01-31
First Publication Date	2022-08-04
Grant Date	2024-04-30
Owner	Ab Initio Technology LLC (USA)
Inventor	Weisman, Amit

Abstract

IPC Classes ?

G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/248 - Presentation of query results
G06F 3/0482 - Interaction with lists of selectable items, e.g. menus

97. TECHNIQUES FOR MANAGING DATA IN A DATA PROCESSING SYSTEM USING DATA ENTITIES AND INHERITANCE

Application Number	US2022014232
Publication Number	2022/165123
Status	In Force
Filing Date	2022-01-28
Publication Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gylfason, Halldor, Isak Parks, Robert Radivojevic, Dusan Weiss, Adam, Harris

Abstract

IPC Classes ?

G06F 16/22 - IndexingData structures thereforStorage structures
G06F 16/21 - Design, administration or maintenance of databases

98. SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

Application Number	US2022014270
Publication Number	2022/165146
Status	In Force
Filing Date	2022-01-28
Publication Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gylfason, Halldor, Isak Parks, Robert

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

99. DATASET MULTIPLEXER FOR DATA PROCESSING SYSTEM

Application Number	US2022014547
Publication Number	2022/165326
Status	In Force
Filing Date	2022-01-31
Publication Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Weisman, Amit Fantasia, Cory, Christopher James Becker, Matthew, Douglas Schechter, Ian, Robert Bach, Edward, Alan Parks, Robert

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

100. SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

Document Number	03206880
Status	Pending
Filing Date	2022-01-28
Open to Public Date	2022-08-04
Owner	AB INITIO TECHNOLOGY LLC (USA)
Inventor	Gylfason, Halldor Isak Parks, Robert

Abstract

IPC Classes ?

G06F 16/25 - Integrating or interfacing systems involving database management systems

1 2 3 ... 8 Next Page