Data Engineer • Co-op • RBC Wealth Management (Data Analytics)
Designed and developed ETL pipelines using Python, Spark, and
SQL, extracting data from various sources such as Postgres DB,
transforming it according to business requirements, and loading it into
S3 buckets for further analysis and reporting.
- Implemented data analytics and machine learning models using Python libraries such as pandas, scikit-learn, and Spark ML, leveraging advanced statistical techniques to uncover patterns and trends in complex datasets.
- Successfully collaborated with cross-functional teams, including data scientists and analysts, to understand business requirements, propose innovative solutions, and deliver data-driven applications and web-based interfaces using technologies like NIFI, Java, C++, and web development frameworks.
- Demonstrated proficiency in containerization using Docker and orchestration with Kubernetes, enabling seamless deployment and management of data engineering applications and services.
- Implemented and maintained CI/CD pipelines using tools like Jenkins, Git, and GitHub, ensuring automated build, test, and deployment processes for efficient software delivery.
- Orchestrated containerized applications and services using Docker and Kubernetes, optimizing scalability, resilience, and resource utilization while maintaining high availability.
- Leveraged cloud platforms such as AWS to design and manage scalable and secure infrastructure, including utilizing services like EC2, S3, and EKS for efficient resource provisioning and management.
- Automated infrastructure provisioning and configuration management using tools like Ansible, enabling efficient deployment and management of software and infrastructure components.
- Collaborated with development teams to streamline software releases, ensuring smooth integration, testing, and deployment processes while maintaining version control and ensuring code quality.
January 2020 – December 2022
Customer Service Representative• Polaris Transportation Group