A method for identifying zones of text in a digital document, including identifying one or more vertical chains of nodes, classifying horizontal spaces between horizontally aligned text objects in the digital document, combining one or more text objects into a segmented horizontal line based on the one or more vertical chains and the classification of horizontal spaces between the one or more text objects, identifying one or more intermediate vertical chains of nodes, refining the segmented horizontal line based on the one or more intermediate vertical chains, and identifying a zone of text based on the one or more vertical chains of nodes and the segmented horizontal line.
A method, apparatus, and non-transitory computer-readable storage medium for altering a digital image for a printing job, the method comprising receiving a requested printing job including the digital image, performing a segmentation on the digital image, extracting values of properties for a segment of the segmented digital image, determining, based on the extracted values of the properties for the segment, whether the segment of the digital image should be altered, and altering the segment of the digital image when it is determined the segment of the digital image should be altered, a resulting altered digital image being transmitted to a printer for printing.
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
A method, apparatus, and non-transitory computer-readable storage medium for altering a digital image for a printing job, the method comprising receiving a requested printing job including the digital image, performing a segmentation on the digital image, extracting values of properties for a segment of the segmented digital image, determining, based on the extracted values of the properties for the segment, whether the segment of the digital image should be altered, and altering the segment of the digital image when it is determined the segment of the digital image should be altered, a resulting altered digital image being transmitted to a printer for printing.
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
Described herein are an apparatus, method, and computer-readable medium. The apparatus including processing circuitry configured to extract a textual content included within a digital document, perform a text search using the extracted textual content on an indexed master document database to identify one or more master documents that are similar, within a pre-determined threshold, to the digital document, generate a candidate master document list using the one or more master documents identified based on the text search, extract a plurality of features of the digital document, perform a comparison, after performing the text search, of the plurality of features of the digital document with features of the one or more master documents in the candidate master document, and identify a master document of the one or more master documents that matches the digital document based on the comparison of the features.
Described herein are an apparatus, method, and computer-readable medium. The apparatus including processing circuitry configured to extract a textual content included within a digital document, perform a text search using the extracted textual content on an indexed master document database to identify one or more master documents that are similar, within a pre-determined threshold, to the digital document, generate a candidate master document list using the one or more master documents identified based on the text search, extract a plurality of features of the digital document, perform a comparison, after performing the text search, of the plurality of features of the digital document with features of the one or more master documents in the candidate master document, and identify a master document of the one or more master documents that matches the digital document based on the comparison of the features.
42 - Scientific, technological and industrial services, research and design
Goods & Services
Computer consultation in the field of medicine and dentistry; Computer software development and computer programming development for others; Computer software development in the field of digital imaging in the medical and dental fields; Computer software development, computer programming and maintenance of computer software for the medical and dental fields; Design and development of computer software for the medical and dental fields; Design, development, and implementation of software for the medical and dental fields
9.
Apparatus, method, and computer-readable storage medium for determining a rotation angle of text
An apparatus, method, and computer-readable storage medium for determining a rotation angle of text. The method includes computing, for each object of a plurality of objects included in text within an image, a distance to a closest neighboring object, computing an average distance of the distances to the closest neighboring objects, determining a ratio between the average distance and an average font stroke width, the average font stroke width being an average of a font stroke width of each of the plurality of objects, and determining a rotation angle of the text by comparing the ratio to a threshold value.
G06K 9/36 - Image preprocessing, i.e. processing the image information without deciding about the identity of the image
G06K 9/32 - Aligning or centering of the image pick-up or image-field
G06K 9/18 - Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints using printed characters having additional code marks or containing code marks, e.g. the character being composed of individual strokes of different shape, each representing a different code value
G06K 9/34 - Segmentation of touching or overlapping patterns in the image field
09 - Scientific and electric apparatus and instruments
Goods & Services
[ computer hardware, printed circuit boards, ] computer software for performing data compression, for use in the field of communication, for performing computer imaging and for use in driving computer printers
09 - Scientific and electric apparatus and instruments
Goods & Services
[ computer hardware, printed circuit boards, ] computer software for performing data compression, for use in the field of communication, for performing computer imaging and for use in driving computer printers
09 - Scientific and electric apparatus and instruments
Goods & Services
computer hardware, printed circuit boards, computer software for performing data compression, for use in the field of communication, for performing computer imaging and for use in driving computer printers