Software Developer - CPU Performance Optimization
Technology
SOFTPATH TECH SOLUTIONS PVT LTDBangalore, India3 weeks agoUntil 31/5/2026
Full time
Job description
Job Summary :
The role involves analyzing, identifying, and resolving performance bottlenecks across multiple software layers including applications, libraries, kernel, and hardware.
The ideal candidate should have strong expertise in C/C , system-level programming, multicore systems, and performance tuning.
This team focuses on performance engineering and optimization, not support or maintenance work.
Key Responsibilities :
- Design, develop, and optimize the CPU performance stack, including applications and libraries running on AMD, ARM, or x86 architectures.
- Perform deep performance analysis and optimize software running on multi-core and multi-node environments.
- Identify and resolve performance bottlenecks related to CPU utilization, memory access, cache efficiency, and thread scaling.
- Work across multiple software layers including applications, system libraries, kernel space, and hardware interfaces.
- Use AI-assisted tools and modern research techniques to implement future-ready performance solutions.
- Collaborate with cross-functional engineering teams to improve system-level performance and scalability.
- Conduct code profiling, debugging, and optimization using industry-standard profiling and performance analysis tools.
- Contribute to performance improvements in high-performance computing (HPC), data center workloads, codec, and DSP applications.
- Continuously learn and adapt to evolving CPU architectures such as x86 and ARM and optimize workloads accordingly.
- Lead technical discussions and contribute to architecture-level optimization strategies.
Required Skills & Qualifications :
- 5 to 7 years of experience in software development with strong system-level programming skills.
- Strong programming expertise in C and C .
- Experience working on multicore CPU architectures and performance optimization.
- Hands-on experience with Linux system programming and debugging.
- Strong knowledge of data structures, algorithms, and system design concepts.
- Experience analyzing performance issues in large-scale or distributed systems.
- Understanding of CPU architecture fundamentals, including :
- Cache subsystems
- Instruction set architecture (ISA)
- CPU pipelines
- Memory hierarchy
- Experience with performance profiling and debugging tools on Linux / Windows / Mac.
Preferred/Nice-to-Have Skills :
- Experience in High Performance Computing (HPC) environments.
- Familiarity with MPI (Message Passing Interface) for parallel computing.
- Experience optimizing workloads for x86 or ARM-based processors.
- Knowledge of GPU performance optimization (optional but beneficial).
- Experience working with DSP, Codec, or Data Center workloads.
- Exposure to multi-node distributed computing environments.
Keywords
C++LinuxSystem ProgrammingMultithreadingCCompiler DesignARMx86ScigressGNU parallelScalabilityMacintoshNode.jsDeep learningInterSystems CachéNodeMessage Passing InterfaceDebugger
¿Te interesa este puesto?