Data Science Jobs
Documentation
https://github.com/uthmandantata/excel_projects
Data source: kagglehttps://www.kaggle.com/datasets/nikhilbhathi/data-scientist-salary-us-glassdoor
TODO - talk about dataset(source, what it contains, how it will be useful)
Data Preperations
Exploratory Analysis
Columns we'll anlyze:Questions Asked
Answers
Code used: =COUNTIF(Python,"Required")
Code used:
=COUNTIF(google_an,"Required")
to get the number of
jobs that require google data analytics certificate & =COUNTA(job_title_sim)
to get the number of jobs
Code used:
=AVERAGEIF(seniority_by_title,"jr",Avg_Salary_K)&"K"
Process used: Pivot Table
(alt H N V)Fields used: Company Name
Process used: Pivot Table
(alt H N V)Fields used: Company Name & seniority_by_title
Code used: =IF(COUNTIFS(seniority_by_title,"sr",Python,"required")>COUNTIFS
(seniority_by_title,"sr",sql,"required"),"Python is more important","Sql is more important" )
Code used:
=IF(COUNTIFS(Table1[job_title_sim],B66,Table1[google_an],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[mongo],"Required"),"Google",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[mongo],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[flink],"Required"),"Mongo",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[flink],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[bi],"Required"),"Flink",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[bi],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[tableau],"Required"),"BI",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[tableau],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[hadoop],"Required"),"tableau",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[hadoop],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[tensor],"Required"),"hadoop",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[tensor],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[scikit],"Required"),"tensor",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[scikit],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[pytorch],"Required"),"scikit",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[pytorch],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[keras],"Required"),"pytorch",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[keras],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[sas],"Required"),"keras",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[sas],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[sql],"Required"),"sras",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[sql],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[excel],"Required"),"sql",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[excel],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[aws],"Required"),"excel",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[aws],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[spark],"Required"),"aws",
IF(COUNTIFS(Table1[job_title_sim],B66,Table1[spark],"Required")>COUNTIFS(Table1[job_title_sim],B66,Table1[Python],"Required"),"spark","Python")))))))))))))))