crawl data from http://seccionamarilla.com.mx (repost)
$100-500 USD
Selesai
Dibuat sekitar 14 tahun yang lalu
$100-500 USD
Dibayar ketika dikirim
[login to view URL] is a yellow pages directory service. The usability of the website sucks. I want the data to be dig, and stored in a local database. I want a script that fetch all the information and stores it so I can re-run the script again and fetch data again.
Categories, geoposition (long, lat) names, state, description, images, logo, graphical add, etc. All this data is public information they release to everyone so it should not be complicated to crawl.
I would prefer the script to be coded in PHP 5 and the database in MySQL, bids offering the script with this technical requirements will be taken more into account, but it's not an exclusive requirement. The script will run in a Linux server, any 3rd party libraries are accepted (zend fraemwork, PEAR, ezComponents, etc...)
The goal is the script to fetch all data.
## Deliverables
if you type on the first textbox "hotel" and in the second "Queretaro" you will get all hotels from the state Queretaro (a state in Mexico) That is a search result.
From my undestanding the DB is designed as this:
states have citties, citties have categories, and each category has the different companies.
"guia de ciudades" is a list of cities, inside the different states of Mexico.. probably that's the best way to start.
* * *This broadcast message was sent to all bidders on Tuesday Jan 19, 2010 3:18:00 PM:
if you type on the first textbox "hotel" and in the second "DF" you will get all hotels from the state Queretaro (a state in Mexico) That is a search result. From my undestanding the DB is designed as this: states have citties, citties have categories, and each category has the different companies. "guia de ciudades" is a list of cities, inside the different states of Mexico.. probably that's the best way to start.
* * *This broadcast message was sent to all bidders on Tuesday Jan 19, 2010 5:31:20 PM:
I don't know how many categories there are, as many as you can find. That's why I posted that from my undestanding there current data is stored in the following structure: states -> cities -> categories ie: [login to view URL] shows me the categories for the city "Cancún" from the state "Quintana Roo" (state id= QR) there it shows the following categories and sub categories: category: Entretenimiento (entertainment) sub categories: * Balnearios * Balnearios * Discotecas-Salas para Bailar * Academias de Actuación * Academias de Alta Cocina y Repostería category: Restaurantes (restaurants) subcategories: * Restaurantes-Cocina Internacional * Restaurantes Comida para Llevar y a Domicilio * Pastelerías * Pizzas-Elaboración de etc etc..