Job Description
Design and implement scalable ML and deep learning models using PyTorch, TensorFlow, Scikit-learn, and other modern frameworks. Build and optimize RAG pipelines using models like GPT, Claude, or other LLMs integrated with document retrieval systems. Develop production-ready ML applications in cloud environments (AWS, SageMaker, Databricks, etc.). Leverage GPU-based computing resources and CUDA/Nvidia for performance optimization in training and inference. Collaborate with data engineers, software developers, and product teams to deliver AI/ML capabilities in production. Conduct rigorous model evaluations, A/B testing, error analysis, and performance tuning. Implement responsible AI principles and ensure model fairness, interpretability, and compliance. Explore and integrate open-source LLM models and fine-tuning strategies for specialized domain applications. Drive innovation in prompt engineering and generation pipelines for AI-assisted systems.
About GalaxE
GalaxE, now Endava, is a professional IT services firm that specializes in platform-driven solutions and the use of automation.