Remote, but you must be in the following locations
Provectus, a leading AI consultancy and solutions provider specializing in Data Engineering and Machine Learning. With a focus on helping businesses unlock the power of their data, we leverage the latest technologies to build innovative data platforms that drive results. Our Data Engineering team consists of top-tier professionals who design, implement, and optimize scalable, data-driven architectures for clients across various industries.
We are seeking a talented and experienced Data Engineer to join our team at Provectus. As part of our diverse practices, including Data, Machine Learning, DevOps, Application Development, and QA, you will collaborate with a multidisciplinary team of data engineers, machine learning engineers, and application developers.
* Collaborate closely with clients to deeply understand their existing IT environments, applications, business requirements, and digital transformation goals.
* Collect and manage large volumes of varied data sets.
* Work directly with ML Engineers to create robust and resilient data pipelines that feed Data Products.
* Define data models that integrate disparate data across the organization.
* Design, implement, and maintain ETL/ELT data pipelines.
* Perform data transformations using tools such as Spark, Trino, and AWS Athena to handle large volumes of data efficiently.
* Develop, continuously test, and deploy Data API Products with Python and frameworks like Flask or FastAPI.
* Experience handling real-time and batch data flow and data warehousing with tools and technologies like Airflow, Dagster, Kafka, Apache Druid, Spark, dbt, etc.
* Experience in AWS.
* Proficiency in programming languages relevant to data engineering, such as Python and SQL.
* Proficiency with Infrastructure as Code (IaC) technologies like Terraform or AWS CloudFormation.
* Experience in building scalable APIs.
* Familiarity with Data Governance aspects like Quality, Discovery, Lineage, Security, Business Glossary, Modeling, Master Data, and Cost Optimization.
* Upper-Intermediate or higher English skills.
* Ability to take ownership, solve problems proactively, and collaborate effectively in dynamic settings.
* Experience with Cloud Data Platforms (e.g., Snowflake, Databricks).
* Experience in building Generative AI Applications (e.g., chatbots, RAG systems).
* Relevant AWS, GCP, Azure, Databricks certifications.
* Knowledge of BI Tools (Power BI, QuickSight, Looker, Tableau, etc.).
* Experience in building Data Solutions in a Data Mesh architecture.
* Participate in internal training programs (Leadership, Public Speaking, etc.) with full support for AWS and other professional certifications.
* Work with the latest AI tools, premium subscriptions, and the freedom to use them in your daily work.
* Long-term B2B collaboration.
* 100% remote β with flexible hours.
* Collaboration with an international, cross-functional team.
* Comprehensive private medical insurance or budget for your medical needs.
* Paid sick leave, vacation, and public holidays.
* Equipment and all the tech you need for comfortable, productive work.
* Special gifts for weddings, childbirth, and other personal milestones.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Your email won't be used for commercial purposes. Read our Privacy Policy.