Grooper 21.00.0082 is available as of 12-12-2023! Check the Downloads Discussion for the release notes and to get the latest version.
Grooper 23.00.0044 is available as of 06-20-2024! Check the Downloads Discussion for the release notes and to get the latest version.
Grooper 23.1.0026 is available as of 09-16-2024! Check the Downloads Discussion for the release notes and to get the latest version.
Grooper 24.0.0013 is available as of 10-04-2024! Check the Downloads Discussion for the release notes and to get the latest version.
Extract Table data when few lines of text between the Table Header & Data first Row
ramesh
Posts: 6 ✭
Hi,
I am facing difficulty while extracting data using Table section Key Value Pair list where it contains few lines of text between the Header section & Start of the data row.
1. Header rows comes only one time per page
2. Some text between Header & first Data Row(s) (first set)
3. There will be some text between next set of Data Row(s)
Like below:
-----------------------------------------------
HCol1 HCol2 HCol3 HCol4
-----------------------------------------------
Line 1
Line 2
Line 3
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
-----------------------------------------------
Footer Text
-----------------------------------------------
other text 1
other text 2
other text3
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
-----------------------------------------------
Footer Text
-----------------------------------------------
I am facing difficulty while extracting data using Table section Key Value Pair list where it contains few lines of text between the Header section & Start of the data row.
1. Header rows comes only one time per page
2. Some text between Header & first Data Row(s) (first set)
3. There will be some text between next set of Data Row(s)
Like below:
-----------------------------------------------
HCol1 HCol2 HCol3 HCol4
-----------------------------------------------
Line 1
Line 2
Line 3
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
-----------------------------------------------
Footer Text
-----------------------------------------------
other text 1
other text 2
other text3
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
Data1 Data2 Data3 Data4
-----------------------------------------------
Footer Text
-----------------------------------------------
Tagged:
0
Best Answer
-
jclark Posts: 60 ✭✭✭Hello,
Please see the Grooper Wiki Article that should help explain the data collection issue you had asked about above.
https://wiki.grooper.com/index.php?title=Row_Match_(Table_Extract_Method)#Use_Cases:_Deep_Dive
Please let us know if you have any questions or comments.5
Answers
Please let us know if this method resolves your issue.
Thank you for the help. I tried with above instruction, but I am unable to extract the data.
I am attaching sample image, hope this will help you in understanding the format.
1. Header comes only once at the beginning of the page, Header will not repeat for each table
2. Page can contain multiple Table (As shown in the Picture).
3. In a Table , a Row Only 1st, 2nd & 3rd Columns will always have the Values. Other columns may not contain values for all the Columns (column values missed randomly),
4. Some times , a Row may expand to 2nd line also.
5. The distance between the Page Border & Row start will differ from image to image.
6. I am using IP profile to clean up Table(lines), background color that row contains.
I do have examples of this Highmark Blueshield format for testing with on our side. I do know that our development team was working on issues with these types of formats for a future version update but I am not sure where that is at currently. I will look at the examples I have to see if I can give you a solution for this type of format with the current version of Grooper.
We are currently working on a Grooper Wiki Article to give detailed instructions for this format. We expect it to be completed early next week and will give an update when it is finished and ready for viewing.
I will try the way to extract the data.