Find Jobs
Hire Freelancers

Convert PDF and Microsoft Word document to JSON using python 2.6

$250-750 USD

Ditutup
Dibuat sekitar 8 tahun yang lalu

$250-750 USD

Dibayar ketika dikirim
I have several hundred resumes in PDF and Microsoft Word format. I want a python script that is capable of extracting the data from these file formats and generating a JSON document containing the resume content. The JSON that is generated should adhere to the standards defined here: [login to view URL] I have included 2 sample files, but will provide a much larger set of test resumes to be used in the development effort. The script should include automated testing (check the resulting JSON matches the manually created JSON document for the corresponding PDF/Word document).
ID Proyek: 10289742

Tentang proyek

9 proposal
Proyek remot
Aktif 8 tahun yang lalu

Ingin menghasilkan uang?

Keuntungan menawar di Freelancer

Tentukan anggaran dan garis waktu Anda
Dapatkan bayaran atas pekerjaan Anda
Uraikan proposal Anda
Gratis mendaftar dan menawar pekerjaan
9 freelancer menawar dengan rata-rata $545 USD untuk pekerjaan ini
Avatar Pengguna
Hello, My proposal is for windows app using .Net. I have experienced in read the doc and PDF using open source libraries in C# such as openxml . If you are interested, please let me know. I'm looking some more samples. Thanks, Sheik
$474 USD dalam 10 hari
4,9 (46 ulasan)
5,8
5,8
Avatar Pengguna
Good at python/pdf/doc processing, and your project looks OK for me at first glance. Please contact me to discuss more detailed requirement, Thanks
$500 USD dalam 7 hari
4,9 (32 ulasan)
5,4
5,4
Avatar Pengguna
Hi Boss, Issue: extract resume I will analyst good algorithm for extract resume. In my experience, some algorithm is good for analyst data if the algorithm have threshold value and we can define that value. Because one algorithm good for some pattern but not good for other pattern. We can't limit the pattern in this world, that's why we need threshold. scope: input: 1. *doc & *pdf 2. document should be readable and content is text, not image out of scope: 1. no OCR process Thanks, catbig
$745 USD dalam 15 hari
5,0 (10 ulasan)
4,1
4,1
Avatar Pengguna
I have recently started to work as a freelancer. However, I do believe I will be one of the most appropriate candidate for this project. Being a lead developer in a medium to large commercial software dev team for over several years, I can assure you that the project will be delivered on time meeting the set requirements. Having great experience with data handling/analytics, writing various ETL processes and writing multiple custom language parsers, the quality of the code that you get will be at its highest possible standard.
$555 USD dalam 5 hari
0,0 (0 ulasan)
0,0
0,0

Tentang klien

Bendera UNITED STATES
Dupo, United States
5,0
8
Memverifikasi Metode pembayaran
Anggota sejak Jan 23, 2006

Verifikasi Klien

Terima kasih! Kami telah mengirim Anda email untuk mengklaim kredit gratis Anda.
Anda sesuatu yang salah saat mengirimkan Anda email. Silakan coba lagi.
Pengguna Terdaftar Total Pekerjaan Terpasang
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuat pratinjau
Izin diberikan untuk Geolokasi.
Sesi login Anda telah kedaluwarsa dan Anda sudah keluar. Silakan login kembali.