Software Engineer (Python, SQL)
Technology
Confidential
1 weeks agoUntil 6/5/2026
Full timeFully remote
Job description
- *About The Company**
- *About The Role**
- *Qualifications
- Bachelor's degree in Engineering, Computer Science, IT, or a related field
- 12+ years of total IT experience
- 8+ years of hands-on software development, data engineering, or analytics with a focus on AI/ML delivery (Azure preferred), using Scala, Python, or PySpark
- 4+ years of experience working with Databricks
- 4+ years of experience with ADF/Airflow for orchestration and scaling
- 4+ years of experience with big data technologies and streaming platforms such as Hadoop, MapReduce/HDFS, Spark, Kafka; Docker/Kubernetes
- 4+ years of experience with MySQL and NoSQL databases
- 4+ years of experience working within Agile/Scrum methodologies, GitHub, Jenkins CI/CD, and JUnit, with a strong focus on coding standards and code reviews
- 2+ years of experience with LLMs and Generative AI tools, including Langchain, LangGraph, RAG, Vector DB, Azure Open AI, MCP Server, and LangFuse
- 2+ years of experience with container technologies like Docker and Kubernetes
- 1+ years of experience building full-stack or service-oriented applications using frameworks such as FastAPI/Flask, Node.js, React/Angular, TypeScript, HTML/CSS
- *Responsibilities
- Design and implement multi-agent workflows where large language models (LLMs) plan, decompose tasks, invoke tools and APIs, and synthesize answers from diverse data sources and services
- Develop retrieval augmented generation (RAG) and hybrid search pipelines to facilitate robust question answering over clinical and operational data
- Code, test, document, and maintain high-quality, scalable Big Data and cloud-based solutions
- Create scalable microservices and APIs to integrate agent capabilities into clinician tools and internal applications
- Develop prototypes and proof-of-concept solutions, conduct design and code reviews to ensure delivery quality and mitigate risks
- Leverage and adapt LLMs through prompt engineering, grounding, domain adaptation, and guardrails to meet healthcare-specific requirements
- Establish evaluation frameworks to measure model faithfulness, helpfulness, bias, toxicity, privacy leakage, and overall quality, incorporating automatic and human-in-the-loop assessments
- Collaborate with data engineering teams to build feature stores, retrieval pipelines, embeddings, and ETL/ELT processes on platforms like Spark and Databricks
- Define and develop APIs for enterprise-wide data integration, optimizing data access for low latency inference
- Own MLOps/LLMOps processes, including CI/CD pipelines, automated testing, model versioning, lineage, and deployment strategies such as blue/green or canary releases
- Instrument Service Level Objectives (SLOs), Service Level Indicators (SLIs), and cost KPIs with dashboards and alerts to monitor system performance and efficiency
- Lead production deployments on internal platforms, ensuring observability, reliability, and cost control measures are in place
- Champion security, privacy, and compliance standards aligned with HIPAA and other regulated industry controls, including access controls, encryption, and auditability
- Collaborate with legal, compliance, and clinical safety teams to operationalize responsible AI principles
- Analyze customer requirements, define technical architecture, and contribute to product delivery roadmaps
- Provide effort estimates, resource planning, and technical documentation; mentor engineers and data scientists; stay current with industry trends and advancements
- *Benefits
- Comprehensive health benefits package
- Incentive and recognition programs
- Equity stock purchase options
- 401(k) retirement plan contributions
- Flexible remote work environment
- Opportunities for professional development and career growth
- Supportive and inclusive workplace culture
- *Equal opportunity
Keywords
pythonsqlcomputer-scienceinformation-technologysoftware-developmenttraining-and-developmentdata-engineeringanalyticsdata-analyticsartificial-intelligencemachine-learningmicrosoft-azurescalapysparkazure-databricksdatabricksairflowapache-airflowservice-management-and-orchestration-smobig-datahadoopapache-hadoopmapreducesparkkafkadockerkubernetesmysqlnosqlgithubjenkinscustomer-intelligence-cicontinuous-integrationcd-certificate-of-depositci-cdjunitprogramming-style-guidecode-reviewgenerative-artificial-intelligence-generative-ailangchainlanggraphretrieval-augmented-generation-ragdesign-build-d-bdefined-benefit-plansmicrosoft-certificationmodel-context-protocol-mcpfastapiflasknodejsreact-jsreacttypescriptmicrosoft-typescriptcascading-style-sheets-cssplanning-and-designvisual-art-designproduct-development-and-designlarge-language-model-llmsearch-and-retrievalsensors-test-measurementmicroservicesproof-of-conceptproof-of-concept-pocprompt-engineeringguardrailsracking-protectionsafety-barriershealth-careassessment-assessment-toolshuman-in-the-loop-hitlextract-transform-and-load-etlelectronic-titleextract-load-transform-eltdata-integrationdata-accessnetwork-latencymachine-learning-ops-mlopslarge-language-model-ops-llmopstesting-and-analysisautomation-testingmodel-version-controlobjectives-and-key-resultsperformance-indicatorobservabilitycost-controlcost-managementcompliancehealth-information-privacy-hipaahipaa-compliancedata-encryptionenvironment-health-and-safety-hsseresponsible-aiyouth-organizations-resourcesplanning-and-forecastingelectrical-engineering-and-planningmentoringpensions-retirement-benefitsretirement-planningremote-workingecology-environmentprofessional-development
¿Te interesa este puesto?