Find Jobs
Hire Freelancers

data project ( All bid will be checked) Kindly check the details

$10-30 AUD

Imefungwa
Imechapishwa about 2 years ago

$10-30 AUD

Kulipwa wakati wa kufikishwa
In this project, you will develop an Oozie workflow to process and analyze a large volume of flight data. • Instructions: 1. Form a project team of four students (including yourself). 2. Install Hadoop/Oozie on your AWS VMs. 3. Download the Airline On-time Performance data set (flight data set) from the period of October 1987 to April 2008 on the following website: [login to view URL]:10.7910/DVN/HG7NV7 4. Design, implement and run an Oozie workflow to find out a. the 3 airlines with the highest and lowest probability, respectively, of being on schedule; b. the 3 airports with the longest and shortest average taxi time per flight (both in and out), respectively; and c. the most common reason for flight cancellations. • Requirements: 1. Your workflow must contain at least three MapReduce jobs that run in fully distributed mode. 2. Run your workflow to analyze the entire data set (total 22 years from 1987 to 2008) at one time on two VMs first and then gradually increase the system scale to the maximum allowed number of VMs for at least 5 increment steps, and measure each corresponding workflow execution time. 3. Run your workflow to analyze the data in a progressive manner with an increment of 1 year, i.e. the first year (1987), the first 2 years (1987-1988), the first 3 years (1987-1989), …, and the total of 22 years (1987-2008), on the maximum allowed number of VMs, and measure each corresponding workflow execution time. • Submission (all in a zipped file: [login to view URL]): 1. A [login to view URL] text file that lists all the commands you used to run your code and produce the required results in a fully distributed mode 2. An [login to view URL] text file that stores the final results from all the runs 3. The source code of your MapReduce programs (including the JAR files) and any other programs you might have developed and included in the workflow 4. The Oozie workflow XML file 5. A project report in PDF that includes: a. A diagram that shows the structure of your Oozie workflow b. A detailed description of the algorithm you designed to solve each of the problems c. A performance measurement plot that compares the workflow execution time in response to an increasing number of VMs used for processing the entire data set (22 years) and an in-depth discussion on the observed performance comparison results d. A performance measurement plot that compares the workflow execution time in response to an increasing data size (from 1 year to 22 years) and an in-depth discussion on the observed performance comparison results
Kitambulisho cha mradi: 33638922

Kuhusu mradi

4 mapendekezo
Mradi wa mbali
Inatumika 2 yrs ago

Unatafuta kupata pesa?

Faida za kutoa zabuni kwenye Freelancer

Weka bajeti yako na muda uliopangwa
Pata malipo kwa kazi yako
Eleza pendekezo lako
Ni bure kujiandikisha na kutoa zabuni kwa kazi
4 wafanyakazi huru wana zabuni kwa wastani $18 AUD kwa kazi hii
Picha ya Mtumiaji
I am very skilled in data entry and Excel works. I am also very qualified in data extracting and tele communicating. If you hire me, you will get many services at one time investment. I am very much confident of our succession together as I am punctual and creative.I look forward to hear from you soon. Thank you.
$10 AUD ndani ya siku 7
0.0 (0 hakiki)
0.0
0.0
Picha ya Mtumiaji
Hi, I have three year work experience of MS Office. Advance excel. I will do your job in cheap price. Regards, Rajinder Singh
$20 AUD ndani ya siku 7
0.0 (0 hakiki)
0.0
0.0
Picha ya Mtumiaji
I am offering my services on short notice. Relevant skills and experience, please consider me and give me a chance to impress you by my quality services.
$20 AUD ndani ya siku 7
0.0 (0 hakiki)
0.0
0.0

Kuhusu mteja

Bedera ya EGYPT
Cairo, Egypt
4.9
38
Njia ya malipo imethibitishwa
Mwanachama tangu Okt 25, 2018

Uthibitishaji wa Mteja

Asante! Tumekutumia kiungo cha kudai mkopo wako bila malipo kwa barua pepe.
Hitilafu fulani imetokea wakati wa kutuma barua pepe yako. Tafadhali jaribu tena.
Watumiaji Waliosajiliwa Jumla ya Kazi Zilizochapishwa
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Onyesho la kukagua linapakia
Ruhusa imetolewa kwa Uwekaji wa Kijiografia.
Muda wako wa kuingia umeisha na umetoka nje. Tafadhali ingia tena.