Skip to main content

Big Data PySpark Developer

Technology
LTIMindtree
Irving, United States1 weeks agoUntil 6/8/2026
Full timeOn-site

Job description

Requirements

Must have:

- Minimum of 3 years of experience in Hadoop programming with HDFS, utilizing PySpark and Hive-based data warehouse projects.
  • Proficiency in Big Data technologies such as PySpark, Hive, and Hadoop.
  • Strong programming skills in PySpark.
  • Solid understanding of organizational strategies, architecture patterns, Microservices, and Event Driven technology, with experience in coaching teams for execution in accordance with these frameworks.
  • Capability to implement organizational technology patterns in projects and offer alternative solutions.
  • Hands-on experience managing large data volumes with diverse ingestion and processing patterns, including batch and real-time, along with independent decision-making within project scopes.
  • Strong grasp of data structures and algorithms.
  • Ability to test, debug, and resolve issues within agreed Service Level Agreements (SLAs).
  • Skills in designing software that is easily testable and observable.
  • Understanding of how team objectives align with business needs.
  • Capability to identify project-level business challenges and suggest solutions.
  • Knowledgeable about data access methods, streaming technologies, data validation, data performance, and cost optimization.
  • Excellent SQL skills.

Responsibilities:

- Develop and maintain Hadoop-based solutions using PySpark and Hive.
  • Design and implement robust data pipelines for efficient data processing and ingestion.
  • Engage in collaborative discussions with the team to align technical strategies with organizational goals.
  • Provide guidance and mentorship to junior developers on best practices and project execution.
  • Diagnose and resolve technical challenges to ensure project deadlines are met.
  • Create scalable and maintainable code, following design principles and coding standards.
  • Conduct performance tuning and optimization of data processes.
  • Regularly communicate project updates and challenges to stakeholders.
  • Stay current with emerging technologies and integrate relevant advancements into projects.

Company:

At LTIMindtree, we pride ourselves on being a leading global technology consulting and digital solutions provider. Our mission is to empower enterprises across various sectors by innovating business models and accelerating their growth through cutting-edge digital technologies. With a robust workforce of nearly 90,000 skilled professionals operating in over 30 countries, we are committed to delivering exceptional business outcomes and client satisfaction. Located in Irving, Texas, we offer a comprehensive benefits package, including medical, dental, and vision coverage, disability insurance, a 401(k) plan with company matching, life insurance, and generous paid leave policies. We foster a diverse and inclusive work environment where all employees can thrive.

Keywords
CodingApache HadoopSqlHadoopHiveCoding conventionsBig dataDebugger

¿Te interesa este puesto?