Find Jobs
Hire Freelancers

Building a scalable web scraper for a large number of different websites

€6-12 EUR / hour

Ditutup
Dibuat lebih dari 3 tahun yang lalu

€6-12 EUR / hour

The goal of the project is to build a scalable web scraper which should scrape data from more a dozen different websites at first. Later on, it should be possible to upscale the scraper to a few thousand websites. Those websites are known and should be added iteratively to the scraper. The websites have a different structure each which is why the development and maintenance costs per site need to stay as small as possible. The aim is to scrape the websites on a weekly basis at first. Later on, the scraping intervals should be reduced to a daily basis or even shorter. The scraped data needs to be stored in an useful and efficient way in a database in the cloud. Furthermore, the scraping must be intolerant to changes in the designs of the websites and it must prevent being blocked. Currently, a simple scraper in Python exists which can scrape a few websites by using the Selenium library. However, this does not need to be continued at all cost. The following tasks are part of your engagement for the project: o Developing a modular and scalable software architecture for the web scraping project (preferably with Python) o Containerizing the program in Docker o Deploying and managing the containers in the cloud, probably with AWS and Kafka o Implementing different measures to prevent blacklisting and being blocked o Setting up a SQL database, probably PostgreSQL with AWS The following tasks might be part of a further engagement: o Implementing the web scrapers for a large number of different websites o Maintaining and monitoring the scrapers for the websites o Adding a web crawler to find additional websites o Parsing the stored data and processing them into a more useful format Your qualifications: o Web Scraping (Importance: 9/10) o Python (Importance: 7/10) o Docker (Importance: 8/10) o AWS (Importance: 5/10) o Kafka or other Pipelining/Queuing Tools (Importance: 8/10) o Cloud Databases (Importance: 6/10) o PostgreSQL (Importance: 10/10) You are expected to work closely together with our developer in Germany. The tasks above need to be coordinated and done in cooperation with him. Therefore, a willingness to work between 10 AM and 10 PM Central European Time is required. We wish to get to know you first by working together in a limited project scope. If you are a fit for our team, we are willing to intensify our cooperation with you and hire you for future projects.
ID Proyek: 28930972

Tentang proyek

8 proposal
Proyek remot
Aktif 3 tahun yang lalu

Ingin menghasilkan uang?

Keuntungan menawar di Freelancer

Tentukan anggaran dan garis waktu Anda
Dapatkan bayaran atas pekerjaan Anda
Uraikan proposal Anda
Gratis mendaftar dan menawar pekerjaan
8 freelancer menawar dengan rata-rata €10 EUR/jam untuk pekerjaan ini
Avatar Pengguna
we are using python in scraping Please, contact me and send me the link to the site so I could make a FREE SAMPLE Please, contact me and send me the link to the site so I could make a FREE SAMPLE Hi there, I’ve read your brief, We are a large team of developers and I’m pretty confident that I will be able to scrape any site you prefer automatically, extract its data in Excel Format (.xlsx, .csv) and send a FREE SAMPLE before we start. We scraped many sites before like Amazon, eBay, Yellowpages, Yell, HOUZZ, Jumia, Realtor, ....etc. We are a large team who can help you through the Manual work, please contact us. We have a list of satisfied customers, please take a look at our Portfolio. We'll provide you with the work as per your requirement with effective communication, attention to details and within a particular time frame. let’s start and finish it fast, We are looking forward to discussing the project with you.
€8 EUR dalam 40 hari
5,0 (100 ulasan)
6,7
6,7
Avatar Pengguna
Hello there. I am very interested in your project. *** As web scraping and python expert ***. I can handle this and am confident of winning. So I have rich experience in scraping app development with python , selenium , scraper, beautiful and if you give a chance, perfect result for you will be wait. I guarantee you to will timing and quality… I want to work for you fulltime as per your timeline and can start to work immediately. Thank you
€10 EUR dalam 40 hari
4,9 (7 ulasan)
4,9
4,9
Avatar Pengguna
Hello, This is Amine from Malaysia, a full stack web developer, who has working 5 years of working experiences in this field. I am fully feeling comfortable working with Python, web Scraping, AWS, PostgreSQL.. I will be detail oriented and dedicated. Looking forward to hear from you. Regards. Amine
€10 EUR dalam 40 hari
5,0 (4 ulasan)
3,8
3,8
Avatar Pengguna
Hello. An experienced web extractor doing projects mainly in PHP but Python might also be an option. Thanks for considering Eugene
€15 EUR dalam 40 hari
5,0 (6 ulasan)
4,0
4,0
Avatar Pengguna
Hi Sir Nice to meet you i am expert in python with web scraping at high level. I agree with your time zone confidential level of skiils you wrote above. Plase come in chat and show me details
€12 EUR dalam 40 hari
5,0 (1 ulasan)
4,1
4,1
Avatar Pengguna
Hi, there. Here is an expert web scraping and automation developer who is very familiar with python/Selenium. After checking your job description and skill set, I found this job suits me as well. I can work in the timezone you want and start working right away. Please feel free to discuss the details via private chat. Regards. Stefan.
€12 EUR dalam 40 hari
5,0 (4 ulasan)
3,4
3,4
Avatar Pengguna
This project really caught my eyes. I have the required qualification to do this work. I will be working with python using scrapy framework. There are really javascript heavy website nowadays which really makes it difficult to work with other libraries like selenium and beautiful soup are not reliable in this case. Its again looks like an long project so will be needing your cooperation in all ascepts to complete the project which we can have detailed chat before taking it up. I am flexible with the timing and its not an issue with me. As discussed we can have detailed discussion preferably over call and then we can work together. Looking forward to work with you Thank you, JITHIN JOSEPH
€8 EUR dalam 40 hari
5,0 (15 ulasan)
3,1
3,1
Avatar Pengguna
I have strong experiance on below, please give chance to work on this project. qualifications: o Web Scraping (Importance: 9/10) o Python (Importance: 7/10) o Docker (Importance: 8/10) o AWS (Importance: 5/10) o Kafka or other Pipelining/Queuing Tools (Importance: 8/10) o Cloud Databases (Importance: 6/10) o PostgreSQL (Importance: 10/10)
€6 EUR dalam 40 hari
0,0 (0 ulasan)
0,0
0,0

Tentang klien

Bendera GERMANY
Steinweiler, Germany
0,0
0
Memverifikasi Metode pembayaran
Anggota sejak Feb 18, 2018

Verifikasi Klien

Terima kasih! Kami telah mengirim Anda email untuk mengklaim kredit gratis Anda.
Anda sesuatu yang salah saat mengirimkan Anda email. Silakan coba lagi.
Pengguna Terdaftar Total Pekerjaan Terpasang
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuat pratinjau
Izin diberikan untuk Geolokasi.
Sesi login Anda telah kedaluwarsa dan Anda sudah keluar. Silakan login kembali.