Grooper 21.00.0082 is available as of 12-12-2023! Check the Downloads Discussion for the release notes and to get the latest version.
Grooper 23.1.0016 is available as of 03-15-2024! Check the Downloads Discussion for the release notes and to get the latest version.
Grooper 23.00.0042 is available as of 03-22-2024! Check the Downloads Discussion for the release notes and to get the latest version.
Looking to include a line of text in my value pattern, ending in a comma
opus
Posts: 21 ✭
Can someone help me understand why my value pattern in not picking up a line in my image? I've tried everything I can think of:
thanks in advance.
Darin
thanks in advance.
Darin
0
Best Answers
-
GrooperGuru Posts: 481 adminWhat you have here is actually an OCR synthesis issue. This is a pretty complex part of logic in the software and it has undergone a number of changes over the years. If you're document is deskewed well (and yours appears to be), then this really shouldn't have happened. It would be nice if you could send a copy of the OCR profile you used and the original document over to our support team for analysis.
But as for your core problem, based on the pattern and OCR result you posted above, it shouldn't be returning a result.
Let's start at the "SHL" in your OCR result. According to your lookahead, we can anchor off the \n character before "SHL", so that's an acceptable start. Then getting into your value pattern, the "S" is acceptable as it is not a "\s" or ":". At this point, it is going to continue consuming any character until it gets to a [\t\f:=], which means it will stop when it gets to the "L". So you have now captured up to "SHL". The problem now is that, in order for that to be a valid result, it must be followed immediately by a [\t\f\r|$. And it isn't. I'ts just a single space right there, so the entire result is not valid. Now the pattern tries to start over at the next character position looking for your look ahead, but there isn't another character anywhere on that line that is a [\t\f\n:=]|^, so you get no results. The solution here, as Josh mentioned, is to change your value pattern to be this: [^\s:][^\t\f=]*
The only different is removing the ":" from the second character set. That should allow the pattern to go to the rest of the line as you are probably expecting it to.Matt Harrison
Product Manager
mharrison@bisok.com5
Answers
LA - [\t\f\n:=]|^
LB - [\t\f\r]|$
VP - [^\s:][^\t\f:=]*
How can we account for that?