Find Jobs
Hire Freelancers

Google Scholar Web Scrape

$30-50 USD

Selesai
Dibuat lebih dari 2 tahun yang lalu

$30-50 USD

Dibayar ketika dikirim
I need to scrape search results data from google scholar and avoid getting blocked. There is a similar python based tutorial here [login to view URL] so this should be a quick and easy project. This project actually has 2 parts. The first part is collecting historic data from 2017-2021 for about 45 different searches. Here is an example search - [login to view URL] I will provide a txt file with a list of the exact search parameters on each line which will look similar to: "weight loss" OR "obesity" "Lymphoma" OR "Lymph" OR "Lymphatic" "Eye" OR "Cataract" OR "Retina" OR "Glaucoma" OR "myopia" OR "hyperopia" Whatever solution you create should allow me to add to or modify the specific searches either in a text file or directly in the code. IMPORTANT: The searches need to be filtered to only search the abstracts. The default is to search the entire document. The searches for the first part of this project are historical (from 2017-2021). Therefore, these searches will result in over 10,000 results per search. I need to avoid the limit of 1000 results per search result. I could filter by year but the results will probably still be over 1000 for any given year. So you need to find a solution to capture all historical data. See attached document with further explanation. For the historical searches I need to capture the total results as listed near the top of the page as well as the "Vancouver Citation" data from a link that exists under each result (within the citation javascript link). The data will need to be saved to a mysql table and it will be appended on a weekly or monthly basis with the second part of the project. Again, the first part is to collect the historical data and save it in a mysql database table and the second part is to append that table with new results. When sorting the google scholar results page by date, the results will be defined by the "XX days ago - " text which is seen when you sort results by date (see [login to view URL],47&as_vis=1&q=%22weight+loss%22+OR+%22obesity%22&scisbd=1). For this part, we would simply need the script to only grab results that meet the date criteria (for example if plan to run the script once a week we would need to scrape specific results tothat include one of the following texts: "1 day ago", "2 days ago", "3 days ago", "4 days ago", "5 days ago", "6 days ago", or "7 days ago") Te attached pdf provides further explanation of the needed extraction. The final code should capture the data and store the data in a predefined mysql database table. The code must be delivered as either a fully functioning .py file or .php file. This file will be run weekly as a cron. You will need to work from your own server environment. When the project is complete you can send me a video showing the functioning script and sample output. Be sure to test the script running many searches because the script needs to avoid being blocked by google. You can incorporate a proxy if needed and I will purchase accordingly (the proxy cost should not be more than $5-$10/month). I will create 2 milestones. The first milestone will be 50% of the total and will be released upon confirmation of the functioning script (video and sample output). The final milestone will be released after the script has been delivered and is functioning on my server. When delivering the script you can leave the server section with dummy data. I will modify the script with the server details for the storage of the collected data.
ID Proyek: 31873729

Tentang proyek

9 proposal
Proyek remot
Aktif 2 tahun yang lalu

Ingin menghasilkan uang?

Keuntungan menawar di Freelancer

Tentukan anggaran dan garis waktu Anda
Dapatkan bayaran atas pekerjaan Anda
Uraikan proposal Anda
Gratis mendaftar dan menawar pekerjaan
Diberikan kepada:
Avatar Pengguna
hey, I read what you need. I have scraped from google scholar for a freelancer project. (can give you reference to that project as well) I am interested to talk more details in chat.
$100 USD dalam 2 hari
4,9 (222 ulasan)
6,7
6,7
9 freelancer menawar dengan rata-rata $115 USD untuk pekerjaan ini
Avatar Pengguna
Hey! I am skilled Python software engineer. I am familiar with Python and I have a lot of work experiences in Python, PHP, Data Mining and Web Scraping. I can start right away. I want to discuss for this project in detail. Please send a message to discuss more regarding this project. Thanks for giving opportunity
$150 USD dalam 5 hari
5,0 (30 ulasan)
5,8
5,8
Avatar Pengguna
I am a Data Scientist with Machine Learning Expertise. Please take a look at my profile and reviews for references.
$40 USD dalam 7 hari
5,0 (9 ulasan)
5,2
5,2
Avatar Pengguna
Hello, I hope this finds you well. I have just seen your project requiring; PHP Python Web Scraping Data Mining I believe that my 10-year experience in this field is what you need right away. Avoid the headache of looking further. Let's save time and focus on the real task. My proposed timelines and budget are just placeholder and an open for negotiation to increase or decrease as per the full requirements at hand. Allow me to prove how better my review can be. Smile all the way! Click the message button, so initiate the conversation. Regards, Fridah
$400 USD dalam 7 hari
3,6 (17 ulasan)
5,7
5,7
Avatar Pengguna
Hi Sir! I am professional Python software engineer. I have a lot of work experiences in Python,scraping website I done mana project about scraping website, video. I can start right now. Please contact me to discuss more about this project. Best regards. Nguyen Tuan
$40 USD dalam 7 hari
5,0 (12 ulasan)
3,5
3,5

Tentang klien

Bendera UNITED STATES
aldie, United States
5,0
39
Memverifikasi Metode pembayaran
Anggota sejak Feb 14, 2011

Verifikasi Klien

Terima kasih! Kami telah mengirim Anda email untuk mengklaim kredit gratis Anda.
Anda sesuatu yang salah saat mengirimkan Anda email. Silakan coba lagi.
Pengguna Terdaftar Total Pekerjaan Terpasang
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuat pratinjau
Izin diberikan untuk Geolokasi.
Sesi login Anda telah kedaluwarsa dan Anda sudah keluar. Silakan login kembali.