Regístrese para acceder a todas las funciones de nuestro servicio
  • Búsqueda de ofertas de trabajo
  • Favoritos
  • Crear CV
    Nuevo
  • Sueldos
  • Alertas de empleo

Data Pipeline Engineer, Origination Decisions

Avivacredito

About the role

Our Origination Decisions team builds the systems that decide, in real time, whether to grant a loan to an applicant and under which conditions (amount, term, interest rate). The team is small (4 people) and each member owns a slice of the stack end-to-end:

  • aDeciderarchitect, who designs how decisions are composed and how we model profit;
  • aData Scientist, who trains the most accurate ML models feeding those decisions;
  • aDeploymentengineer, who ships the deciders to production and owns the quality gates around them;
  • and theData Pipeline engineerwe are hiring with this role.

On top of your specialty, you will own a subset of our loan products end-to-end: you will build their datasets, train their models, configure their decider and follow them to production. This "dogfooding" keeps you close to the pain points your pipeline creates and is the main feedback loop that drives the roadmap of your area.

What you will own

As the Data Pipeline engineer you are the main guarantor that the rest of the team always hasfresh, trustworthy, and easy-to-use datasetsto train models, analyze behavior and make decisions on. Concretely, you will:

Build and maintain the data pipeline

  • Own the team's Dagsterpipeline.
  • Keep assets fresh, observable, and cheap to recompute. Make it obvious to other team members which datasets exist, what they contain, and how to consume them.

Bring in new data sources

  • Partner with external data providers on proofs-of-concept: organize backpopulation runs, send them the required samples, store and version the returned data, and evaluate whether the signal is worth productizing.
  • Similarly, explore the data available internally that is under-used for purpose of origination decisions. For instance, transcript of collections calls could become features (for loan renovations), or groundtruth (to get a better picture of the customer than just the payments they made).
  • When a source is promising, integrate it end-to-end: reconcile backpopulation dumps with the live API feed, extract features consistently from both, and expose them to downstream consumers.

Design feature computations, not just move data around

Some of the pipeline work is pure data engineering (joins, aggregations, cleaning), but a lot of it is closer to applied math and ML:

  • DesigningLong Term Value formulasthat chain per-loan profit estimates with time-discounting and population averages for unseen future states (e.g. "average profit of a 3rd loan for customers similar to this one"), so the team can compare counterfactual policies such as "what would the LTV be if we only used base decider X?".
  • Buildingoffline feature storesfor known customers and serving them through a low-latency store so the online decider can use information that doesn't fit in the request payload.
  • Runningreject inferenceas a recurring process: periodically sampling past rejected applications, pulling fresh credit reports, turning them into pseudo-groundtruths, and merging them into training datasets.
  • UsingLLMs and other models inside the pipelinewhen it is the right tool (e.g. extracting features from video-call transcripts, nowcasting loan profits and defaults).
  • Implementingfeature pre-selectioninside the pipeline (ranking by predictive power, de-correlating, keeping the top ~N) so that the datasets we ship are an order of magnitude smaller than today without losing signal.

Own data quality

  • Add tests to most Dagster assets and make a deliberate choice for each one: does a failure block downstream assets, trigger an alert, or simply get logged?
  • Guarantee that refactors and migrations do not silently change the value of existing features.
  • When something breaks, investigate quickly, fix at the root, and leave behind a new test closer to the source of the problem so the class of bug cannot come back unnoticed.

Be a user of your own platform

You will also be responsible for a subset of our products: building their datasets, training ML models on them, configuring a decider and following it into production. Feedback you get as a user directly feeds the priorities of your pipeline work, and gives you concrete grounds to coordinate with the data scientist, decider and deployment engineers on cross-team improvements.

What success looks like after 12 months

  • The team trusts the datasets by default: if a model behaves oddly, the first hypothesis is no longer "maybe the pipeline is wrong".
  • At least one new external data source has been integrated end-to-end, from POC to being used in a production decider.
  • The most important internal data sources are transformed into features available to the data scientist.
  • Training datasets are noticeably smaller and faster to load, thanks to in-pipeline feature pre-selection, without measurable loss in model performance.
  • For the products you own, you have shipped at least one improved decider to production, and the lessons from that experience have shaped concrete improvements in the shared pipeline.

Benefits

  • Attractive compensation package, including stock options.
  • Fast-paced environment with significant growth opportunities.
  • 15 annual vacation days + 7 annual personal days.
  • Option to work remotely 3-4 days per week ; or fully-remote (as long as you can come to CDMX ~twice a year)
  • Flexible work schedule
#J-18808-Ljbffr

Vacante publicada el 1 día atrás
Empleos similares que podrían interesarleBasado en la vacante Data Pipeline Engineer, Origination Decisions en Ciudad de México
  •  ...Airswift is searching for a Senior Gas Export Pipeline Engineer to work with a leading company in the Oil & Gas sector on a major offshore pipeline development. This role is a technical leadership position focused on offshore gas export pipeline systems and subsea infrastructure... 
    Sugerido
    Contratista
    Contrato
    Offshore

    Airswift

    Ciudad de México
    2 días atrás
  •  ...results and raise the bar using technology and data. You have a consistent track record of...  ...that impact your work and aligning decisions with Kavak's vision and purpose. # Play...  ...siguiente nivel. Buscamos un Senior AI Data Engineer para el equipo de Data Platform; un perfil... 
    Sugerido
    Autónomo

    Kavak

    Ciudad de México
    4 días atrás
  • ROSEN Swiss AG in Mexico City is looking for a Pipeline Integrity Engineer to join its team. This entry-level role offers a unique training opportunity, focusing on post In-Line Pipeline Inspection integrity assessments. Candidates should hold a Bachelor of Science in Engineering... 
    Sugerido

    ROSEN Swiss AG

    Ciudad de México
    21 horas atrás
  •  ...Sr Data Engineer ¡Te estamos buscando! En Grupo Modelo estamos expandiendo nuestro equipo de Data & Analytics, buscando Data Engineers...  ...en el código y facilitar la colaboración. Comprensión de pipelines de integración y despliegue continuo (CI/CD) para automatizar... 
    Sugerido
    Práctica

    Grupo Modelo

    Ciudad de México
    3 días atrás
  •  ...Sobre NTT DATA Haz de este el lugar donde crezcas. Como líder global en innovación empresarial y tecnológica, trabajamos con...  ...del negocio. Participar en el diseño y mantenimiento de pipelines de datos escalables y eficientes. Participar en ceremonias Agile... 
    Sugerido
    Aprendiz
    Contrato
    Desde casa
    Trabajo híbrido

    NTT DATA Europe & Latam

    Miguel Hidalgo, Ciudad de México
    2 días atrás
  •  ...Overview As a Sr. Data Engineer on this team, you will own the design and delivery of complex data pipelines, integrations, and data models that serve as the foundation of our...  ...you will translate top-down architectural decisions into production-grade implementations —... 
    Práctica

    TSSI Recruit US

    Ciudad de México
    21 horas atrás
  •  ...Professional Experience: ~3-5 years of professional experience in data engineering or ETL development roles. ~ Demonstrable experience...  ...: Experience with Control-M for job orchestration and Azure Pipelines for CI/CD. Programming & Tools: Required: Advanced... 
    Práctica

    Inetum

    Ciudad de México
    21 horas atrás
  •  ...leading digital product engineering company that designs...  ...capabilities in Cloud, Data, AI, and CX enable us...  ...Overview : As a Data Engineer at R Systems, you will...  ...solutions, ETL pipelines, and business intelligence...  ...color, religion, national origin, sex, physical or... 
    Tiempo completo

    R Systems

    Ciudad de México
    21 horas atrás
  •  ...Sr. Data Engineer At Onetree, we are looking for a Data Engineer to join our team and play a key role in cloud data migration and...  ...DynamoDB, Redis). Experience building and maintaining ETL/ELT pipelines (DBT, Glue, DMS). Strong problem-solving skills and... 
    Horario flexible

    Onetree

    Ciudad de México
    2 días atrás
  •  ...reliable systems that collect, clean, store and deliver company data so dashboards and reporting work properly — and they lead...  ...it. About the Role Work closely with the Senior Data Engineering Manager and Enterprise Data Architect to turn target-state architecture... 

    Reece USA

    Polanco, Miguel Hidalgo, D.F.
    1 día atrás
  •  ...que moldean el futuro de la industria. Descripción del Puesto Estamos en busqueda de un Data Engineer capaz de diseñar, construir, operar y optimizar pipelines de datos en Snowflake, desde la ingesta hasta la publicación de datos confiables para analítica, BI... 
    Aprendiz
    Práctica

    Motivus

    Ciudad de México
    4 días atrás
  •  ...proprietary AI Studio and AI Engines, the company helps drive the clients...  ...: Remote Full-time Senior Data Engineer Are you an...  ...end-to-end real-time and batch pipelines, and developing cloud-native data...  ...orientation, gender identity, national origin, disability, or any other... 
    Tiempo completo
    Remoto

    Fusemachines

    Ciudad de México
    21 horas atrás
  •  ...ingeniería de software con servicios enfocados en Nube, IA y Data , adoptando un enfoque ágil y centrado en el cliente. Impulsamos...  ...infraestructura como código. Responsabilidades: Data Pipelines & Streaming: Diseño y mantenimiento de arquitecturas de eventos... 
    Aprendiz

    Qualtop

    Ciudad de México
    2 días atrás
  •  ...This is the job In Mexico (CDMX) within the Data & Analytics industry, we are actively seeking a Senior Data Engineer to strengthen our team dedicated to building and optimizing end-to-end data pipelines under a Medallion architecture. Your mission will be... 
    Trabajo híbrido

    Avenga

    Ciudad de México
    1 día atrás
  •  ...Buscamos un(a) Data Engineer & BI Specialist que combine capacidades de ingeniería de datos y business intelligence para diseñar soluciones...  ...la organización, integrando múltiples fuentes, automatizando pipelines y desarrollando dashboards que impacten directamente en... 

    dentsu

    Ciudad de México
    2 días atrás
  •  ...Datos. ~ Deseable experiencia productos y/o servicios del sector financiero. ~ Desarrollo de Datos con Phyton o Java. ~ Uso de Data Lake con Snowflakex, SnowPark, Snowpipe. ~ Manejo de SAS: PROC SQL, macros, Libnames. ~ Dominio de BD: Oracle y SQL Server. ~... 
    Trabajar en la oficina
    Remoto
    Trabajo híbrido

    Minsait

    Álvaro Obregón, Ciudad de México
    2 días atrás
  •  ...Data Engineer | Sistemas Legacy Zona de trabajo: CDMX Modalidad: Híbrida Sobre NTT Data En NTT Data , somos una consultora global líder en tecnología y transformación digital, con presencia en más de 50 países. Nos especializamos en ofrecer soluciones innovadoras... 

    NTT DATA, Inc.

    Ciudad de México
    21 horas atrás
  •  ...desarrollar tus habilidades y contribuir al éxito de un equipo dinámico, ¡Esta es tu oportunidad! Objetivo: Estamos en búsqueda de un Data Engineer Jr apasionado por el análisis de datos y la generación de insights estratégicos para la toma de decisiones. Este rol será clave... 

    Publicis Groupe

    Ciudad de México
    4 días atrás
  • · Rol: AWS Data Engineer · Descripción del puesto: Lic. Sistemas, informática o Afín Experiencia de 2 a 4 años como AWS Data Engineer...  ...usando servicios como AWS Glue, AWS Kinesis Orquestar pipelines de datos con herramientas como AWS Step Functions o Apache... 
    Trabajar en la oficina

    Softtek

    Ciudad de México
    3 días atrás
  •  ...parte de este cambio! Descripción del puesto Buscamos un/a Data Engineer con experiencia en proyectos sobre Google Cloud Platform (GCP...  ...en el análisis de requerimientos, construcción de pipelines ETL/ELT y mejora continua del ecosistema de datos en la nube.... 
    Práctica
    Trabajo híbrido
    Lunes a viernes

    Multiplica

    Ciudad de México
    2 días atrás
  •  ...En Grupo Modelo estamos expandiendo nuestro equipo de Data & Analytics, buscando Data Engineers experimentados que nos ayuden a innovar y a crear ¡un...  ...Objetivo del puesto: Desarrollar, mantener y optimizar pipelines y estructuras de datos que aseguren disponibilidad,... 

    Grupo Modelo

    Ciudad de México
    1 día atrás
  • Company Description Somos un grupo internacional de consultoría digital ágil. En la era de la post transformación digital, nos esforzamos por permitir que cada uno de nuestros 27 000 profesionales se renueve continuamente viviendo de manera positiva su propio flow digital...
    Práctica

    Inetum

    Ciudad de México
    21 horas atrás
  • A global professional services firm seeks a Lead Offshore Pipeline Engineer to guide offshore pipeline engineering projects. The ideal candidate has over 10 years of experience, with a minimum of 5 years in offshore pipeline design and execution. Fluent in both Spanish... 
    Offshore
    Horario flexible

    WorleyParsons

    Ciudad de México
    4 días atrás
  •  ...Overview PepsiCo is hiring a Data Engineer to support its supply chain operations. The role involves building ETL pipelines and collaborating with cross‑functional teams. Responsibilities...  ...based on race, religion, color, national origin, gender, sexual orientation, age, marital... 
    Práctica

    Link-Worldwide

    Miguel Hidalgo, Ciudad de México
    13 horas atrás
  •  ...IA Engineer / Ingeniero de Inteligencia Artificial Habilidades Requeridas: Experiencia: ~+4 años de experiencia en Inteligencia Artificial, Machine Learning o áreas relacionadas. ~ Experiencia sólida en frameworks de IA/ML y en el ciclo completo de desarrollo... 
    Práctica
    Trabajar en la oficina

    Softtek

    Ciudad de México
    4 días atrás
  •  ...que conecten negocio, automatización y datos de forma segura, escalable y eficiente. Actualmente buscamos un(a) Data / Governance Integration Engineer para participar en iniciativas estratégicas de integración, gobierno y adopción de soluciones de IA. Objetivo... 
    Aprendiz
    Ocasional
    Trabajar en la oficina
    Trabajo híbrido

    Mobiik

    Ciudad de México
    2 días atrás
  •  ...Requirements: ~5 years of experience in QA with a specific focus on big data quality assurance and analytics ~ Proficiency with big data...  ...Shell scripting to automate QA tasks ~ Experience with data pipeline tools eg Apache Beam Dataflow Spark ~ Knowledge of Agile... 
    Práctica
    Trabajo por turnos

    LTM

    Ciudad de México
    1 día atrás
  • Tenemos una gran oportunidad para ti como Controller de Recursos Humanos ubicado en CDMX Objetivo Garantizar el control, análisis y optimización de los costos laborales para asegurar que las decisiones de Recursos Humanos sean financieramente sostenibles y alineadas...

    Confidential

    Ciudad de México
    1 día atrás
  • Sobre el equipo El equipo de Datos Maestros de Proveedores es responsable de garantizar la precisión, consistencia, integridad y gobernanza de la información de proveedores en los sistemas Empresariales, para que la organización pueda llevar a cabo actividades financieras...

    Walmart Global Tech

    Ciudad de México
    4 días atrás
  • B Capital in Mexico City is seeking a Senior Lead Data Engineer to design and implement robust data solutions. The role demands at least 8 years of experience, proficiency in Python, SQL, and data engineering fundamentals. You will collaborate across teams, modernize data... 

    B Capital

    Ciudad de México
    1 día atrás

¿Desea recibir más vacantes?

Suscríbase y reciba vacantes similares a Data Pipeline Engineer, Origination Decisions. ¡Sea el primero en aplicar!