Document separate by value!

DungVuDungVu Posts: 51

Currently we have a step in the process to do document separation by value change of invoice number. In the above screen-shot, the page #2 is actual an invoice but it was miss fired the invoice number and was append to previous folder. We have tried to setup multi separator with an addition of event based separator for page count of 1. But it doesn't work either. It put the loose page in a folder for classification but the multi page invoice has been split into single page folder. Please give us some direction or method to get this setup to get the loose page in it own folder and multi pages invoice will be in it own folder.

Thank you.

Answers

  • dgreenwooddgreenwood Posts: 10 admin
    Instead of using Multi-Separator would it work to have two Separate steps?  Set the first one to the Change in Value separator and set the "Miss Disposition" property to "None".  Then have a second Separate step using the Page Count Event-Based separator to pick up the loose pages and put them in a folder.  Would that solve the issue?
  • DungVuDungVu Posts: 51
    Thank you for your suggestion. However, now we have an invoice with 2 pages and second page is supporting page and does not have invoice number. It should go with the first page in the same document folder as supporting document. Other issue is on document #8, second page is missing because the batch has been spawned to exception queue process due to flagged items. The second page of #8 document was left in the main process. Could you please point out some direction to handle this scenario?
    Below is the screen-shot for more details.



  • dgreenwooddgreenwood Posts: 10 admin
    Let's go back to your original issue and look at where your separation points should be.  So you have a series of invoices and you're trying to separate off of a changing invoice number.

    So here's something like how it should be working.

    ---New Document---
    Page 1 - invoice number:01
    Page 2 - invoice number:01
    --- New Document---
    Page 3 - invoice number:02
    --- New Document----
    Page 4 - invoice number:03
    Page 5 - invoice number:03
    Page 6 - invoice number:03
    ---New Document----
    Page 7 - invoice number:04

    But it's doing something like this?

    ---New Document---
    Page 1 - invoice number:01
    Page 2 - invoice number:01
    Page 3 - invoice number:02
    --- New Document----
    Page 4 - invoice number:03
    Page 5 - invoice number:03
    Page 6 - invoice number:03
    Page 7 - invoice number:04
  • tgarnetttgarnett Posts: 52 ✭✭
    If I'm understanding the issue, I don't think a PageCount separator would help here. That will just take every loose page and put it in its own folder, which is not what you want since some Invoices have multiple pages.

    If some pages aren't being detected by your separation extractor, you'll want to add another Data Type that will find something unique to the page you want it to separate on. You'd just need to see more examples of that document to see how consistent they are. The Facsimile example page doesn't have a label to anchor the Invoice Number off of, so it would need to be pretty specific to avoid grabbing other numbers.
  • DungVuDungVu Posts: 51
    In some cases we don't have unique value to look for. below in screen-shot in the second page state that PAGE 2 of 2 and the first page contains PAGE 1 of 2. Can we use EPI Extractor to get this invoice separation?

  • tgarnetttgarnett Posts: 52 ✭✭
    That page 2 of 2 was only split into a folder because of the "PageCount 1" separator, correct? I would get rid of that separation method entirely and switch back to your original one.

    The key is to identify anything on the page that tells you that you're looking at page 1 of a document. You already have the "Change in Value" method for Invoice number, creating a new folder whenever a new invoice number appears. You could add a pattern-based separation extractor that finds and splits on any instance of the phrase "PAGE 1 of". You can then add new patterns to this extractor whenever new formats come in that fail to separate.
Sign In or Register to comment.