Find Jobs
Hire Freelancers

Parallel Python Code That Counts How Many Websites Have Canvas

$10-30 USD

Imekamilika
Imechapishwa over 6 years ago

$10-30 USD

Kulipwa wakati wa kufikishwa
I need a simple Python script that scrapes a list of websites in a csv file (e.g. top 500,000 Alexa sites attached), and checks if the website uses Canvas in the HTML (by checking for "<Canvas>") or in JavaScript (by checking for "createElement("canvas")" or "createElement('canvas')"). The code should output the number and percentage of websites using Canvas out of the list. It is recommended that the code uses the Python Libraries “Requests” and/or "BeautifulSoup4" with a similar logic as the one I started writing (attached). The following points need to be satisfied: • The code uses parallel computing for efficiency, so it doesn't run for so long • The http header has to look like it came from a real browser, so websites don't block it • The reading time of a website should not exceed 30 seconds, and should time out if no response for 30 seconds and go to the next website • The script needs to count and print the number of successfully read and unread sites from the csv file of top sites (as the one I am attaching does for the unread). The unread sites could be because a website is no longer available or responsive, or any other reason • The script needs to handle errors and doesn't crash • The script has to print the duration of execution (how many hours, minutes or seconds) • The script has to print the number and percentage of sites containing Canvas either in the HTML source code or JavaScript It would be great if we can have a version that is not parallel to compare the performance, but not super important
Kitambulisho cha mradi: 15614099

Kuhusu mradi

pendekezo 1
Mradi wa mbali
Inatumika 6 yrs ago

Unatafuta kupata pesa?

Faida za kutoa zabuni kwenye Freelancer

Weka bajeti yako na muda uliopangwa
Pata malipo kwa kazi yako
Eleza pendekezo lako
Ni bure kujiandikisha na kutoa zabuni kwa kazi
Imetolewa kwa:
Picha ya Mtumiaji
I am a python expert and i can do your work. i can start immediately. and complete your work on time.
$30 USD ndani ya siku 1
4.9 (161 hakiki)
6.2
6.2

Kuhusu mteja

Bedera ya SAUDI ARABIA
Alkhobar, Saudi Arabia
5.0
3
Njia ya malipo imethibitishwa
Mwanachama tangu Feb 9, 2014

Uthibitishaji wa Mteja

Asante! Tumekutumia kiungo cha kudai mkopo wako bila malipo kwa barua pepe.
Hitilafu fulani imetokea wakati wa kutuma barua pepe yako. Tafadhali jaribu tena.
Watumiaji Waliosajiliwa Jumla ya Kazi Zilizochapishwa
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Onyesho la kukagua linapakia
Ruhusa imetolewa kwa Uwekaji wa Kijiografia.
Muda wako wa kuingia umeisha na umetoka nje. Tafadhali ingia tena.