Nvidia Solutions Architect
Warszawa, Mazowieckie, Polska, 01-208, Wrocław, Dolnośląskie, Polska, 50-086Wichtige Merkmale des Angebots
Hybridmodell - teilweise remote
8+ Jahre Erfahrung
Architektenrolle
Description
Location: Wrocław / Katowice / Warsaw - 2 days in office / 3 days remote Let us introduce you the job offer by EY GDS Poland – a member of the global integrated service delivery center network by EY. The opportunity Join a high-impact role shaping next-generation enterprise AI solutions powered by NVIDIA. You will lead architecture for complex agentic systems, multimodal AI, and real-time vision platforms, working with advanced tooling to transform business use cases into scalable, high-performance AI products deployed at scale. What we look for We value professionals passionate about NVIDIA’s Physical AI ecosystem—Omniverse, OpenUSD, Isaac Sim, and GPU-accelerated robotics. If you thrive on building digital twins, simulating real-world physics, and deploying AI policies to robots, this role is for you. You bring deep technical excellence with strong collaboration skills, enjoy solving complex spatial computing problems, and are excited about pushing the boundaries of simulation-driven engineering. If you’re hands-on, curious, and comfortable working across infrastructure, graphics, and robotics domains in a fast-evolving NVIDIA stack, you will fit in well.
What we offer
EY Global Delivery Services (GDS) is a dynamic and truly global delivery network. We work across nine locations – Argentina, Hungary, India, the Philippines, Poland, Sri Lanka, Mexico, Spain and the United Kingdom – and with teams from all EY service lines, geographies and sectors, playing a vital role in the delivery of the EY growth strategy. From accountants to coders to advisory consultants, we offer a wide variety of fulfilling career opportunities that span all business disciplines. In GDS, you will collaborate with EY teams on exciting projects and work with well-known brands from across the globe. We’ll introduce you to an ever-expanding ecosystem of people, learning, skills and insights that will stay with you throughout your career.
Continuous learning: You’ll develop the mindset and skills to navigate whatever comes next.
Success as defined by you: We’ll provide the tools and flexibility, so you can make a meaningful impact, your way.
Transformative leadership: We’ll give you the insights, coaching and confidence to be the leader the world needs.
Diverse and inclusive culture: You’ll be embraced for who you are and empowered to use your voice to help others find theirs.
Ideally, you’ll also have
Experience with Kafka, GStreamer, RTSP streaming architectures and DeepStream SDK for real-time video analytics.
LLMOps/MLOps experience, including model lifecycle governance and scalable deployment practices.
Knowledge of AI security, guardrails design, knowledge graphs and graph-based retrieval.
Experience with multimodal AI platforms, GPU capacity planning and performance tuning.
Experience working in regulated enterprise environments.
Customer-facing or consulting background, supported by relevant NVIDIA or cloud certifications.
Skills and attributes for success
Strong architectural thinking combined with deep hands-on expertise, ability to translate business needs into complex AI systems, excellent stakeholder communication, problem-solving in distributed GPU environments, and a pragmatic mindset focused on scalability, performance, reliability, and measurable business value.
Your key responsibilities
Design and implement end-to-end AI architectures leveraging NVIDIA NIM, NeMo, DeepStream, and TensorRT; build multi-agent and RAG systems; optimize inference performance; define GPU infrastructure; and collaborate cross-functionally to deliver robust, scalable, and production-grade AI solutions.
To qualify for the role, you must have
8+ years of experience in infrastructure and software engineering, including 2–3 years with NVIDIA technologies.
Hands-on expertise with NVIDIA NIM, NeMo Agent Core, NeMo Guardrails, NeMo Retriever, TensorRT and TensorRT-LLM.
Experience designing multi-agent architectures, agent orchestration patterns and advanced RAG solutions using vector databases.
Strong Python, C++, Linux, PyTorch and OpenCV skills, including experience with modern CV models such as YOLO, SAM or VLMs.
Experience with Kubernetes, Docker, NVIDIA GPU Operator, KServe, cloud platforms and GPU environments such as DGX or InfiniBand.
Willingness to travel occasionally.