Need a GUI application to read an input file with duplicate records and generates an output file without the duplicate records.
Please see this site for all the info you will need:
<[login to view URL]>
If you have experience with perl and c++ and the use of spiders in collecting data from different web sites please let me know in your reply, I may have a project in a month or two.
----------------------------------------
Need a GUI application to read an input file with duplicate records and generates an output file without the duplicate records.
The GUI should ask the user to enter the location and name of the input and output files on the local drive. Both files are in html format.
Here are some sample files with duplicate records:
<[login to view URL]>
<[login to view URL]>
<[login to view URL]>
<[login to view URL]>
<[login to view URL]>
Every record represents a game played and the final results. For example, the first record in the [login to view URL] file is a game played between Orlando and Detroit, the final score was 93-108, its followed by three duplicate records I dont need.
Every time I run the program and enter an output I want the records generated by the program to be APPENDED to the end of the output file.
## Deliverables
**Two important notes,
**1- If there are duplicate records they appear CONSECUTIVELY. lets look at: <[login to view URL]>
The first records is:
| 05/04 | 519 | Orlando | | 93 | 181 | 181½ |
| 9:35am | 520 | Detroit | | 108 | 6 | 6 |
As you can see it appears four STRAIGHT times.
2- The output file should contain simple html tags. For example if when you look at [login to view URL] after the duplicates for the first three records are deleted the output file should look like this:
| 05/04 | 519 | Orlando | | 93 | 181 | 181½ |
| 9:35am | 520 | Detroit | | 108 | 6 | 6 |
| 05/04 | 521 | Portland | | 95 | 198½ | 197 |
| 12:35am | 522 | Dallas | | 107 | 7 | 6 |
| 05/05 | 701 | Boston | | 93 | 181½ | 182 |
| 4:05pm | 702 | New Jersey | | 97 | 7 | 7½ |
Simple source code:
<html>
<head>
<title>Output File</title>
</head>
<body>
<table>
<tr>
<td>05/04</td>
<td>519</td>
<td>Orlando</td>
<td> </td>
<td>93</td>
<td>181</td>
<td>181½</td>
</tr>
<tr>
<td>9:35am</td>
<td>520</td>
<td>Detroit</td>
<td> </td>
<td>108</td>
<td>6</td>
<td>6</td>
</tr>
<tr>
<td>05/04</td>
<td>521</td>
<td>Portland</td>
<td> </td>
<td>95</td>
<td>198½</td>
<td>197</td>
</tr>
<tr>
<td>12:35am</td>
<td>522</td>
<td>Dallas</td>
<td> </td>
<td>107</td>
<td>7</td>
<td>6</td>
</tr>
<tr>
<td>05/05</td>
<td>701</td>
<td>Boston</td>
<td> </td>
<td>93</td>
<td>181½</td>
<td>182</td>
</tr>
## Platform
<tr>
<td>4:05pm</td>
<td>702</td>
<td>New Jersey</td>
<td> </td>
<td>97</td>
<td>7</td>
<td>7½</td>
</tr>
</table>
</body>
</html>