Find Jobs
Hire Freelancers

Wikipedia data dump miner

$15-25 USD / hour

Ditutup
Dibuat lebih dari 6 tahun yang lalu

$15-25 USD / hour

I 'm looking for wikipedia and machine learning expert. - Are you an expert Wikipedia dump files? - Do you love to write scripts that automate extractions? - Which scripting languages do you already know? Python, Bash? - Work closely with our teams building user experiences and collaborative machine learning algorithms. What do you think of this fist task 1. given two languages , say en and zh. 2. and a page category , like Living people. 3. and a specified WP dump date. 4. generate a set of sets of name string. where each set has all of the en redirects and zh redirects for a given pair of en-zh linked titles. For example, the set for Vladimr Putin's page would have all his redirects in English as well as his page name in Chinese and all of its redirects. If you like that as a starting task, please give me an hour estimate for it and we can start a contract with that as the first task going forward we have a bunch of tasks of this kind.
ID Proyek: 14845502

Tentang proyek

10 proposal
Proyek remot
Aktif 7 tahun yang lalu

Ingin menghasilkan uang?

Keuntungan menawar di Freelancer

Tentukan anggaran dan garis waktu Anda
Dapatkan bayaran atas pekerjaan Anda
Uraikan proposal Anda
Gratis mendaftar dan menawar pekerjaan
10 freelancer menawar dengan rata-rata $21 USD/jam untuk pekerjaan ini
Avatar Pengguna
I can write a script using Python's request library that will generate a set of sets of name string, based on your specified criteria. The request library is really powerful and allows features such as persistent sessions (for fast querying). I can complete the initial task in 6 hours. If you are interested, we can talk via chat and I can tell you more about my previous (similar) work!
$25 USD dalam 20 hari
5,0 (8 ulasan)
4,5
4,5
Avatar Pengguna
i would like to offer you my expertiseas I have done number of my academic projects and I am a professional in the field Contact me and I’ll show you what i am capable of
$22 USD dalam 40 hari
4,0 (18 ulasan)
3,1
3,1
Avatar Pengguna
I'm CTO at datascraping [dot] club, we provide data scraping and websites scrapping services, have a lot of experience with machine learning and data scrapping in general. Would love to chat about your project and share my experience. Thanks
$22 USD dalam 40 hari
5,0 (1 ulasan)
2,2
2,2
Avatar Pengguna
Hello, my name is Michael. I represent Ukrainian based IT-company Webbook Inc that provides services in the IT-sphere for international business. We were carefully reviewing the requirements of the job description, so our devs can work on Your project without delay. We have years of working on projects related on any available CMS, from "scratch" with core php and php-frameworks(Yii/Yii2, Laravel, CodeIgniter), JavaScript, jQuery, AJAX, HTML5, CSS3, Bootstrap, javascript-frameworks, 3d desidg, graphic design etc. However, I shall discuss about the requirements and functionalities in details to have a better understanding about time frame and price. We are glad to chat with You and discuss all in details. Contact us and we will reply immediately. Waiting for Your reply! Best regards, Webbook team
$22 USD dalam 40 hari
0,0 (0 ulasan)
0,0
0,0
Avatar Pengguna
I have hands on expertise in python ( beautiful soup ) web crawling, I am also a data engineer where day job involves creating data pipelines for extraction, transformations.
$27 USD dalam 30 hari
0,0 (0 ulasan)
0,0
0,0
Avatar Pengguna
I have been editing Wikipedia more then 3 years, also I have use Pywikibot with my own scripts. Beside that, I love everything related to Wikipedia and I will do this job with love :).
$15 USD dalam 40 hari
0,0 (0 ulasan)
0,0
0,0

Tentang klien

Bendera CHINA
Beijing, China
5,0
1
Anggota sejak Apr 7, 2017

Verifikasi Klien

Terima kasih! Kami telah mengirim Anda email untuk mengklaim kredit gratis Anda.
Anda sesuatu yang salah saat mengirimkan Anda email. Silakan coba lagi.
Pengguna Terdaftar Total Pekerjaan Terpasang
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuat pratinjau
Izin diberikan untuk Geolokasi.
Sesi login Anda telah kedaluwarsa dan Anda sudah keluar. Silakan login kembali.