-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathclassification.html
More file actions
254 lines (230 loc) · 11.9 KB
/
classification.html
File metadata and controls
254 lines (230 loc) · 11.9 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8"/>
<meta http-equiv="X-UA-Compatible" content="IE=edge;" />
<meta http-equiv="Cache-Control" content="no-store, must-revalidate" />
<meta name="viewport" content="width=device-width, initial-scale=1" />
<title>Document Classification with Grooper</title>
<link rel="stylesheet" href="styles/font-awesome/web-fonts-with-css/css/fontawesome-all.min.css">
<link rel="stylesheet" href="styles/main.css">
<link href="https://fonts.googleapis.com/css?family=Roboto:100,100i,300,300i,400,400i,500,500i,700,700i,900,900i" rel="stylesheet">
<script src="https://ajax.googleapis.com/ajax/libs/jquery/3.2.1/jquery.min.js"></script>
<script id="mcjs">!function(c,h,i,m,p){m=c.createElement(h),p=c.getElementsByTagName(h)[0],m.async=1,m.src=i,p.parentNode.insertBefore(m,p)}(document,"script","https://chimpstatic.com/mcjs-connected/js/users/2e3eeca20c1420746bffa37d4/96f5c58e37a4abf4a30ce18a4.js");</script>
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-101038961-2"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-101038961-2');
</script>
</head>
<body>
<nav id="nav_primary">
<a href="./index.html"></a>
<input id="checkbox_click" type="checkbox" onclick="noScroll()"/>
<i class="fa fa-bars"></i>
<i class="fa fa-times"></i>
<ul>
<li><a href="./features.html">Features</a></li>
<li><a href="./roadmap.html">Roadmap</a></li>
<!--
<li><a href="./pricing.html">Pricing</a></li>
<li><a href="./training.html">Training</a></li>
-->
<li><a href="./support.html">Support</a></li>
<li><a href="./contact.html">Contact</a></li>
<li><a href="http://xchange.grooper.com">Grooper x Change</a></li>
</ul>
<div class="social">
<a href="https://www.facebook.com/BusinessImagingSystems"><i class="fab fa-facebook"></i></a>
<a href="https://plus.google.com/u/0/b/115220706750340433625/+Bisok/posts"><i class="fab fa-google-plus"></i></a>
<a href="https://twitter.com/BIS_Tweets"><i class="fab fa-twitter"></i></a>
<a href="https://www.linkedin.com/company/business-imaging-systems?trk=prof-exp-company-name"><i class="fab fa-linkedin"></i></a>
<a href="https://www.youtube.com/channel/UCiJPKqS_enHrFsX49cngqag"><i class="fab fa-youtube"></i></a>
</div>
</nav>
<header>
<canvas id="header_canvas"></canvas>
<h1>Automate Document Classification</h1>
<p>Manually selecting doc types is a thing of the past. Quickly regain control of sprawling, unstructured document collections by automatically organizing them into logical groups based on similarity rankings in models that <strong>you train and control.</strong></p>
</header>
<div id="nav_secondary">
<nav id="nav_secondary_container">
<div id="nav_secondary_contents" class="nav_secondary_contents">
<a href="paper-capture.html">Modern Paper Capture</a>
<a href="electronic-documents.html">Electronic Document Processing</a>
<a href="image-optimization.html">Image Optimization</a>
<a href="ocr.html">Synthetic OCR</a>
<!--<a href="atomic-regex.html">Atomic RegEx</a>-->
<a href="classification.html" aria-selected="true">Document Classification</a>
<a href="natural-language-processing.html">Natural Language Processing (NLP)</a>
<a href="design-studio.html">Design Studio</a>
<a href="modern-architecture.html">Modern Architecture</a>
<span id="nav_secondary_indicator"></span>
</div>
</nav>
<button id="nav_secondary_left" type="button">
<svg class="nav_advancer_icon" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 551 1024"><path d="M445.44 38.183L-2.53 512l447.97 473.817 85.857-81.173-409.6-433.23v81.172l409.6-433.23L445.44 38.18z"/></svg>
</button>
<button id="nav_secondary_right" type="button">
<svg class="nav_advancer_icon" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 551 1024"><path d="M105.56 985.817L553.53 512 105.56 38.183l-85.857 81.173 409.6 433.23v-81.172l-409.6 433.23 85.856 81.174z"/></svg>
</button>
</div>
<main>
<section class="col-3 border-img shadow_img">
<h2>Feature Collection with ESP</h2>
<p>Grooper's ESP engine identifies the distinguishing features of each page to group collections of images together as classified documents. ESP uses three key feature collection mechanisms:</p>
<figure>
<figcaption>
<h3>Lexical</h3>
<p>NLP examines the <strong>language</strong> of the complete document to understand its meaning and determine what it is.</p>
</figcaption>
<img src="images/features/classification/grooper-classification-lexical.jpg" />
</figure>
<figure>
<figcaption>
<h3>Rules-Based</h3>
<p>Finds unique <strong>key words or phrases</strong> that positively identify a document, like a Title or Section Heading.</p>
</figcaption>
<img src="images/features/classification/grooper-classification-rules-based.jpg" />
</figure>
<figure>
<figcaption>
<h3>Visual</h3>
<p><strong>Computer vision</strong> identifies structured forms based on what they look like without having to read from OCR.</p>
</figcaption>
<img src="images/features/classification/grooper-classification-computer-vision.jpg" />
</figure>
</section>
<section class="col-1 side_by_side figure-large">
<h2>ESP Separation</h2>
<p>Train document examples and see how the ESP Separation engine interprets the content of each page, <em>Groops</em> pages into documents, and simulates page breaking and classification.
<figure>
<img class="img-lrg" src="images/features/design-studio/grooper-design-studio-esp-separation-testing.jpg" />
<figcaption>
<ul>
<li>A simple "train-by-example" interface lets you quickly teach ESP how to identify each document.</li>
<li>Real-time confidence scores show you both the document type and assumed page number for each page in a batch.</li>
<li>Estimated Page Index (EPI) identfies page numbers on your documents. This information is used by ESP to determine if an unknown page is likely part of a surrounding document.</li>
</ul>
</figure>
</section>
<section class="col-1 figure-large">
<figure>
<figcaption>
<h2>Classification</h2>
<p>Provide document examples and watch is Grooper begins to learn the correct Doc Type for each instrument provided. When doing batch testing, unclassified items (those with low confidence scores) can be flagged and sent to a queue for additional training.</p>
</figcaption>
<img class="img-lrg" src="images/features/design-studio/grooper-design-studio-classification-review-testing.jpg" />
</figure>
</section>
<!--
<section class="col-1 side_by_side align_top align_bottom align_right">
<figure>
<figcaption>
<h2>Classification</h2>
<p>Performs automated separation and classification of loose pages in a batch.</p>
<p>Handles complex document separation without the need for separator sheets.</p>
<p>Training-based approach provides simple, fast implementation.</p>
</figcaption>
<img src="images/features/classification/grooper-esp-separation-before-and-after-2.jpg">
</figure>
</section>
-->
</main>
<div id="prev_next">
<div>
<!--<a href="./atomic-regex.html">-->
<a href="./ocr.html">
<i class="fa fa-arrow-left" aria-hidden="true"></i>
<span class="direction">Previous</span>
<span>Synthetic OCR</span>
</a>
</div>
<div>
<a href="./natural-language-processing.html">
<i class="fa fa-arrow-right" aria-hidden="true"></i>
<span class="direction">Next</span>
<span>Natural Language Processing</span>
</a>
</div>
</div>
<footer>
<section id="schedule_demo" class="col-1 bg-dark gray">
<a class="btn-lrg" href="https://goo.gl/forms/kwEe075qCFCWIl2X2" target="_blank">Schedule Demo</a>
<h2></h2>
<p>Meet with one of our Grooper Gurus to ask questions you have about Grooper's functionality and pricing model. Then, see a live presentation that demonstrates Grooper's uniqueness, the end-user experience, and how the magic works behind the scenes. No "smoke and mirrors" here.</p>
</section>
<div id="footer_links">
<div id="contact">
<h4>Contact</h4>
<ul>
<li><a href="mailto:info@grooper.com">info@grooper.com</a></li>
<li><a href="tel:1-800-408-5668">Support: 1-800-408-5668</a></li>
<li><a href="tel:1-405-507-7000">Sales: 1-405-507-7000</a></li>
<li>13900 N. Harvey Avenue</li>
<li>Edmond, OK 73013</li>
</ul>
</div>
<div id="discover">
<h4>Discover</h4>
<ul>
<li><a href="./features.html">Features</a></li>
<li><a href="./roadmap.html">Roadmap</a></li>
<!--
<li><a href="./pricing.html">Plans & Pricing</a></li>
<li><a href="./training.html">Training Program</a></li>
-->
<li><a href="http://xchange.grooper.com">Grooper x Change</a></li>
<li><a href="./support.html">Get Support</a></li>
<li><a href="./contact.html">Contact Us</a></li>
</ul>
</div>
<div id="signup">
<h4>Newsletter</h4>
<form action="https://bisok.us2.list-manage.com/subscribe/post?u=2e3eeca20c1420746bffa37d4&id=98680ff514" method="post" id="mc-embedded-subscribe-form" name="mc-embedded-subscribe-form" class="validate" target="_blank" novalidate>
<div id="mc_embed_signup_scroll">
<div class="mc-field-group">
<!--<label for="mce-EMAIL">Email Address </label>-->
<input type="email" value="" name="EMAIL" class="required email" id="mce-EMAIL" placeholder="Email Address">
</div>
<div class="mc-field-group">
<!--<label for="mce-FNAME">First Name </label>-->
<input type="text" value="" name="FNAME" class="" id="mce-FNAME" placeholder="First Name">
</div>
<div class="mc-field-group">
<!--<label for="mce-LNAME">Last Name </label>-->
<input type="text" value="" name="LNAME" class="" id="mce-LNAME" placeholder="Last Name">
</div>
<div class="mc-field-group">
<!--<label for="mce-COMPANY">Company </label>-->
<input type="text" value="" name="COMPANY" class="" id="mce-COMPANY" placeholder="Company">
</div>
<div id="mce-responses" class="clear">
<div class="response" id="mce-error-response" style="display:none"></div>
<div class="response" id="mce-success-response" style="display:none"></div>
</div> <!-- real people should not fill this in and expect good things - do not remove this or risk form bot signups-->
<div style="position: absolute; left: -5000px;" aria-hidden="true"><input type="text" name="b_2e3eeca20c1420746bffa37d4_98680ff514" tabindex="-1" value=""></div>
<div class="clear content__button"><input type="submit" value="Subscribe" name="subscribe" id="mc-embedded-subscribe" class="button"></div>
</div>
</form>
</div>
</div>
<div class="social">
<a href="https://www.facebook.com/BusinessImagingSystems"><i class="fab fa-facebook"></i></a>
<a href="https://plus.google.com/u/0/b/115220706750340433625/+Bisok/posts"><i class="fab fa-google-plus"></i></a>
<a href="https://twitter.com/BIS_Tweets"><i class="fab fa-twitter"></i></a>
<a href="https://www.linkedin.com/company/business-imaging-systems?trk=prof-exp-company-name"><i class="fab fa-linkedin"></i></a>
<a href="https://www.youtube.com/channel/UCiJPKqS_enHrFsX49cngqag"><i class="fab fa-youtube"></i></a>
</div>
<div id="copyright">
<span>©2018 Grooper, LLC. All Rights Reserved.</span>
<img src="images/g_symbol_64.png" />
</div>
</footer>
<script src="js/animate-header.js"></script>
<script src="js/nav-primary.js"></script>
<script src="js/nav-secondary.js"></script>
</body>
</html>