Technical Lead AI/ML Infrastructure (Python/C++/Linux)
Job ID: ID lead_C/C++
Full time
Bucharest
Senior
04-05-2026
The project:
Our goal is to benchmark AI/ML cluster fabric with realistic workloads which typically require investments into compute systems with GPUs and RDMA NICs that are costly and time-consuming to build and operate. To achive this goal, we're creating a product that is able emulate realistic collective communication that is able to test AI/ML fabrics both for conformance and performance.
The Team:
Was founded in order to fill a gap on the AI/ML data center workloads testing market.
Currently has 15 members and it’s part of a bigger team spread accros multiple geos.
We live in the virtual environment, working in Agile, meeting daily to plan and track our objectives.
We are working directly with our partners as well as the end customer in a dynamic fashion.
We are always designing and developing new features, as well as resolving customer’s enquiries, and working closely with other teams.
Employer Profile
a leading global provider of hardware and software for electronic design, emulation, and testing
Responsibilities
In this role you will become proficient in:
Core Skills:
Very strong and advanced Python 3 development skills in async environments with full type annotations
Strong API design and modeling, focused on user experience and product requirements
Expertise in designing backward- and forward-compatible APIs and accurate domain modeling
Hands-on experience with OpenAPI 3.x (YAML/JSON) and RESTful services (e.g., Connexion)
Deep knowledge of async Python stacks (asyncio, pytest-asyncio, SQLAlchemy async)
Solid background in Linux user-space development, networking, performance optimization, and algorithms
Complementary Skills:
Protobuf and gRPC; understanding differences from RESTful APIs
pytest with async support
Familiarity with GitLab CI, Make, Python packaging, and Docker
Responsibilities:
Participate in the analysis, design, development and maintenance of AL/ML related products.
Maintain and enhance current products and participate in the design and development of applications for both internal and external use.
Interact with project management, leads, testers and other developers in order to understand the features, planning the schedule, designing and implementing the solutions, optimizing, performing development testing and bug-fixing in order to deliver high quality releases on time.
Requirements
Qualifications
What you need:
Experience with creating and developing API design and modeling
Strong knowledge of Python programming
Good knowledge of Linux user space
Good probblem solving skills
Network programming expertise
Good algorithms/data structures knowledge
Ability to quickly learn and grasp new technologies
Desire and ability to work in a highly collaborative, team-oriented environment
Prove to be performance driven and having a proactive attitude
Bonus skills (not mandatory but appreciated)
Good knowledge of virtualization technologies (Qemu/KVM)
Good knowledge with linux containers technology and Docker/Kubernetes