Document Scanner
$30-100 USD
Dibayar saat pengiriman
We receive a large number of files a day that contain various content like (EDIFACT-, ANSI X.12-, XML-, Comma Separated- or fixed length files). Due to this we are looking for a java program that is capable of recognizing these various documents.
## Deliverables
Examples of how the program can match the DocumentType of a file are: For XML a file can be recognized based on the XML root and/or a element or value in a element/attribute. For a fixed length file one needs to be able to match a specific line with a ID and/or match a specific portion of a line. For a comma separated file one needs to be able to match based on one a specific field and/or on a specific line. Basically the configuration should be very flexible so that any received can be recognized. Once recognised is needs to retrieve a Sender and Received ID from the file. Were these ID’s can be found in the file should work the same way as how a document is recognized. At the end if a file has successfully gather the information it should write out this information to a xml file which would look something like this. Name of the input filename Path of the input file Size of the inputfile in bytes Can either be EDIFACT, ANSIX12, TRADACOMS, CSV OR FLF</ FileContent> Date time of processing SenderID derived from the input file ReceiverID derived from the input file Type of document that program found in Database The documentID derived from the database Due to the number of files received the tool should be build with performance in mind. I know a lot about how to recognize files the the file types but my Java knowledge is pretty poor so I can’t program it myself. Finally the the source code should be well documented. We prefer the java code to be written so that it can be reused. A interface/package type application has our preference.
* * *This broadcast message was sent to all bidders on Tuesday Dec 30, 2008 2:28:03 PM:
Hi, Please hold your bids. Later today or at the latest tomorrow i will add some documentation for this p9roject together with some sample files. However the basic functionality of the program to recognize data files remains the same. Most data files except EDIFACT, ANSI [url removed, login to view] Tradacoms, have all the required info in them. Fixed Length files usually work with a unique RecordID at the beginning of the line and hold the information to recognize the data. Comma Separated Value (CSV) files can also have a unique recordid at the beginning of a file or a certain value on one of the lines to recognize the document and of course sender/receiver. Meanwhile there is enough documentation on EDIFACT, ANSI [url removed, login to view], Tradacoms on the internet together with examples on how these files look. I ask you all to go ahead and think about a solution for this project. Thanks in advance in showing your interest. Marco
* * *This broadcast message was sent to all bidders on Wednesday Dec 31, 2008 8:11:45 AM:
To all, A zip file containing examples, specifications and File definitions have been added to the bid request. Please look carefully through it. And let me know your bid and if you are able to do it. Please bare in mind that this needs to be a generic program that has its own mechanism to identify the type of document and retrieve the information from it. The configuration for how to identify a file needs to come from a MySQL (at first) database. You are responsible for creating this database. To configure to program for it to recognize a document needs to be a simple process. Warm regards. Marco
* * *This broadcast message was sent to all bidders on Wednesday Dec 31, 2008 10:46:05 AM:
All, Thank you all for your enthusiasm and your bids. At this point it is hard to choose the right coder because none has given a insight on how the program will work in recognizing the document and what the user needs to fill in for the program to start recognizing a specific type of document. Please let me know your ideas and based on that i will choose the coder that i feel most confident in making this program. Thanks and i hope to hear from you soon.
ID Proyek: #3504972