Data Scientist

  • Caterpillar
  • Two Prudential Plaza, 180 N Stetson Ave #2400, Chicago, IL 60601, USA
  • Nov 09, 2020
[Information Technology]

Job Description

**Data Scientist** **Description** Cat Digital is the digital and technology arm of Caterpillar Inc., responsible for bringing world class digital capabilities to our products and services. With almost one million connected assets worldwide, we're focused on using IoT and other data, technology, advanced analytics and AI capabilities to help our customers build a better world. Cat Digital's Advanced Data Quality team is looking for a talented and motived Data Scientist to help improve platform data quality by developing and delivering ML/AI models to address the most challenging data quality issues. As a Data Scientist, you will apply machine learning and other analytics techniques on a very large set of diverse data from IoT connected assets and our integrated network of dealers. **JOB DUTIES** : As a Data Scientist, you will contribute to the design, development, deployment, and quality of Caterpillar's state-of-the-art digital platform through development of advanced Data Quality methods and routines. * Designs, codes, tests, and debugs programs of varying degrees of complexity * Evaluates recommended software and/or program changes and their potential impact on the environment and execution results * Works on application/technical problem identification and resolution, including off-shift and weekend support functions * Works independently on complex programs/subroutines * Under the direction of more experienced staff, assists in the development of major system modules and programs * Fully qualified to perform most programming assignments without close supervision * Fully knowledgeable of programming languages, program design and specification development, programming logic, logic diagrams, testing and debugging * May perform integration tasks for in-house developed systems and/or purchased software solutions * Improves development and support processes * Employee is also responsible for performing other job duties as assigned by Caterpillar management from time to time **Qualifications** **BASIC QUALIFICATIONS:** + MS degree or higher in quantitative discipline such as applied statistics, data science, data analytics, computer science, computer engineering, engineering or other related degree + Minimum cumulative GPA requirement 3.0/ 4.0 (no rounding) + Graduation date between May 2020 through May 2021 + Proficient in Python and SQL **TOP CANDIDATES WILL ALSO HAVE:** + Completed coursework in machine learning and/or computational methods in statistics and data mining + Completed coursework or projects in natural language processing + Completed research or class projects in machine learning (classification, regression, unsupervised learning) + Knowledge of relational data bases; the knowledge of NoSQL data bases is a definite plus + Familiarity with AWS (SageMaker, Athena, S3, RDS, DynamoDB, Lambda, EC2) + Experience with Snowflake + Knowledge of visualization tools like Tableau, MS Power BI, Kibana, etc. + Good analytical and problem-solving skills and be detail oriented + Effective time management skills + Strong verbal and written communication skills + Ability to work independently or as a collaborative team member + Ability to learn and comply with company policies and procedures + Ability to clearly communicate technical ideas, regardless of the technical capacity of the audience + Passion for technology and an eagerness to contribute to a team-orientated environment + Passion for working in a dynamic environment where digital is still evolving as a core offering EEO/AA Employer. All qualified individuals - including minorities, females, veterans and individuals with disabilities - are encouraged to apply. **Job** Digital **Primary Location** United States-Illinois-Chicago **Unposting Date:** Nov 6, 2020, 11:59:00 PM**Req ID:** 2000069J
Associated topics: data analytic, data architect, data center, data integrity, data scientist, data warehousing, erp, mongo database, sybase, teradata