Data Science, Emerging Talent Intern
Position at MTA Headquarters
Departments: Strategic Initiatives, Data and Analytics Team
Location: 2 Broadway, New York, NY
Position Title: Data Science, Emerging Talent Intern
Hourly Rate $19.00 (Undergraduate), $21.00 (Graduate), $22.00 (Post-Graduate)
Overview of Department:
The MTA Data and Analytics Team builds modern data infrastructure, owns the product experience of analytics within the MTA, manages our institution’s Open Data Program, and tackles the agency’s biggest data-driven challenges. The Data Science Intern will help the MTA Data and Analytics Team assemble, transform, and manage large data sources to support better decision-making in service delivery, infrastructure maintenance, financial and administrative management, and all other business activities at the MTA. They will work with data and process owners to build analytic products using programming expertise in Python, SQL, or R. They will also work with the team’s Data Engineers, which design and build pipelines, and the Reporting group, which delivers reporting products to clients.
Responsibilities:
- Write code to clean, combine, and transform data generated from business operations
- Design and document algorithmic processes to carry out data transformations
- Incorporate quality checks in data processes to ensure sustainable accuracy
- Build Apache Airflow pipelines to deploy data processes as long-term, maintainable data pipelines
- Update existing data processes to manage input changes, address new business priorities, and improve maintainability
- Monitor the health of existing data pipelines, and help fix them if breakdowns occur
- Assist in data analyses to answer questions from MTA leadership
- Assist in the creation of presentation documents (e.g., PowerPoints) for presentation of data findings and products to MTA leadership and other MTA groups
Projects:
- Migration of legacy data pipelines related to subway and bus performance to Apache Airflow and Python
- Assist in creation of new pipelines for calculating subway and bus ridership patterns, and subway and bus performance
- Assist in creation of new pipelines for calculating key MTA-wide datasets on material procurement
Required Qualifications:
Strong data analytics skills with knowledge of probability, statistics, and algorithms
Proficient in Python or SQL and Excel, Power BI, Tableau, or Mode Analytics
Skilled in coding, documentation, and data quality checks
- Proficiency in Microsoft Office Suite is a must.
- The candidate should possess organizational, analytical and communication skills.
- The candidate should be able to work well under pressure and prioritize tasks effectively.
- The candidate should have a keen eye for detail and be able to work independently while being an active team player.
Knowledge of data engineering; Apache Airflow experience is a plus
Familiarity with transit systems and MTA is helpful but not required
Required Education:
- Matriculated in an undergraduate program in good standing with at least 2.5 GPA or graduate program with at least a 2.8 GPA.
- Major(s): Computer Science, Data Science, Transportation, Urban Science and Informatics, Economics, Statistics, or related field.
All applicants must be authorized to work in the United States at the time of application. Student's transcript must be submitted.
Equal Employment Opportunity
MTA and its subsidiary and affiliated agencies are Equal Opportunity Employers, including with respect to veteran status and individuals with disabilities. The MTA encourages qualified applicants from diverse backgrounds, experiences, and abilities, including military service members, to apply