Around 2+ years of experience in various domains as a Data Engineer in building and deploying data-intensive projects for Data Acquisition, Data Visualization, and Data Mining with large datasets of Structured and Semi-structured data.
Experience
Around 2+ years of experience in various domains as a Data Engineer in building and deploying data-intensive projects for Data Acquisition, Data Visualization, and Data Mining with large datasets of Structured and Semi-structured data.
Experienced in Python to manipulate data for data loading and extraction and worked with Python libraries like Matplotlib, NumPy, Seaborn, Stats Models, and Pandas for data analysis.
Solid ability to write and optimize diverse SQL queries, working knowledge of RDBMS like Teradata MySQL, and SQL Server.
Proficient in SQL databases like MySQL, MS SQL, and PostgreSQL. Worked with UNIX/Linux including commands and shell scripting.
Proficient in all activities related to high-level design of ETL mappings for integrating data from multiple heterogeneous data sources (Excel, Flat File, Text format Data) using Informatica Power Center.
Have experience in Data engineer, AWS (S3, EC2, RDS, EMR, Redshift).
Install, configure, and maintain the RDBMS software, ensuring it runs smoothly and efficiently.