Data Engineer
Remote Poland, PolskaОсновні характеристики вакансії
Мін. 5 років досвіду
Дані: SQL / BI / Python
DevOps / Хмара: AWS, Azure, Docker, Kubernetes
Повний робочий день
Віддалена робота - без поїздок
Description
Join the Data Engineering team to contribute to the ongoing maintenance and improvement of an internal assistant that uses hosted APIs and internal knowledge sources, with a focus on reliability, retrieval quality, and operational excellence.
What we offer
Global Relocation - (Relocation options; Experience in an international environment; Cross-cultural experience)
Recognition and Evaluation - (Feedback culture; Regular appraisals)
Time Off - (Annual holiday - 20 or 26 days. The duration of the leave depends on the overall seniority; Occasional leave - 1 or 2 days/ depending on the circumstances; Child care leave - 2 days or 16 hours per year; Absence due to force majeure - 2 days or 16 hours per year; Maternity Leave - 20 weeks; Parental Leave - 41 weeks; Paternity Leave - 14 days)
Luxoft Training Center - (Expert-led tech courses covering basic to advanced topics; Internal instructor-led soft skills courses; Comprehensive in-house self-learning resources for both soft and hard skills; Access to external self-learning libraries like ProQuest eBook and Udemy for Business; Cloud Programs: MS Cloud Academy, AWS Partner Academy, Google Cloud Academy; Custom Learning Programs: upskilling, reskilling, technical mentorship; Leadership Programs for Managers)
Well-being and Work-life Balance - (Multisport card; Possibility to order Multisport card at the corporate rate for family members; LuxGood Program: wellbeing seminars, contests, relaxation sessions, yoga sessions, etc.; One Team Program: Buddy for each New Joiner; seminars, meeting and workplace space to support integration with local community and culture; “Hire me” workshops for partners; Preferential banking offer; Preferential car leasing offer; Cafeteria program discounts for shops, cinema tickets, holiday offers; Luxoft Social Benefit Fund: sport and recreation benefits, the possibility to receive financial support)
Health Care - (Private Healthcare Insurance with unlimited access to specialists; Full dental support; Travel Insurance; Possibility to add private healthcare coverage for family members at the corporate rate; Life insurance at the corporate rate for employees and family members, including payment of the basic package for the employee by the employer; Reimbursement for corrective glasses)
Company Events and Friendly Environment - (Many fun social activities organized by the Luxoft team offline in your city; Online entertainment events for whole company and local team events; A workplace where you’re treated with respect within a multicultural team)
Internal Mobility - (Rotation between projects and accounts; New career opportunities)
Self-Learning Library
CSR Projects
Other
Languages: English: C1 Advanced
Seniority: Senior
Requirements
8+ years of hands-on experience in Data Science and 2+ years in Machine Learning, with a proven track record, demonstrated through a robust portfolio of projects.
Strong programming skills in languages such as Python and familiarity building ETL pipelines.
Expertise in SQL and experience with both relational (preferably Postgres) and NoSQL databases
Solid experience with OpenSearch
Familiarity with AWS cloud platform and its services.
Experience with version control systems (e.g., Git) and CI/CD pipelines.
Ability to build scalable infrastructure to embed and search very large number of documents.
Ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines.
Expertise in ML
Strong English skills (B2 and higher)
Strong verbal and written communication skills.
Ability to work independently and collaborate in a group.
Agile certification
Oracle/Microsoft attestations and certifications
Domain knowledge
Trading and Capital Markets
Responsibilities
Maintain and enhance ingestion/enrichment pipelines for internal content (parsing/extraction, normalization, metadata enrichment, deduplication, and quality monitoring)
Implement and maintain access-aware retrieval by propagating/enforcing document permissions through indexing and query-time filters, including audit logs and validation tests
Improve source attribution so responses reliably point to the correct documents and sections in a consistent format.
Extend and harden tool/workflow execution and automations (scheduled/trigger-based), including retries, timeouts, idempotency, concurrency controls, and run history
Operate the platform in production: observability (logs/metrics/tracing), alerting, incident support, performance tuning, and cost controls, plus runbooks and handover documentation
Ключові слова / Навички