About Algebra

Algebra builds and operates AI-powered workflows for mid-market companies as a managed service. We identify high-value operational processes, design AI systems to run them, and own the outcome end to end.

We are not consultants. We do not sell software licenses. We are the accountable operator.

The Role

This is a senior data engineering role for someone who can turn messy business data into the foundation for reliable AI workflows.

Algebra’s agents are only as good as the data layer underneath them. You will build the pipelines, models, integrations, and data infrastructure that allow our systems to understand client operations, surface the right context, and take action inside real business processes.

You need to be strong enough technically to design the data architecture, but practical enough to work inside imperfect client environments. You should be comfortable moving between data pipelines, warehousing, APIs, orchestration, transformation logic, data quality, governance, and production debugging.

This is a builder role. You will help define the data patterns, standards, and infrastructure Algebra uses as it scales from custom client deployments into repeatable systems.

If your instinct is to find the truth inside messy data, structure it properly, and make it useful for systems that actually run the business, this is the role.

What You'll Do

Data Infrastructure and Pipelines

Build production-grade data pipelines that support Algebra’s AI-powered workflows
Design data ingestion patterns across client systems, APIs, databases, spreadsheets, documents, and third-party tools
Build transformation logic that turns fragmented operational data into usable workflow context
Own data movement across staging, production, and client-specific environments
Set up orchestration, scheduling, monitoring, and failure handling for critical data flows
Create reusable data pipeline patterns for repeatable client deployments

Data Modeling and Workflow Context

Design data models that represent real business workflows, entities, rules, and operational states
Build the structured data layer that allows AI agents to reason over client operations
Translate messy business processes into clean schemas, relationships, and workflow-ready data structures
Support use cases such as document intake, reconciliation, approvals, reporting, exception handling, and operational alerts
Work closely with application and AI engineers to make sure data is usable inside actual product experiences
Build systems that preserve context, lineage, and auditability where required

Integrations and Data Sources

Connect to client systems, CRMs, ERPs, databases, finance tools, communication platforms, and document repositories
Build reliable API integrations, sync jobs, webhooks, and ingestion workflows
Handle authentication, permissions, data mapping, schema changes, and integration failure states
Work with unstructured and semi-structured data, including PDFs, documents, emails, spreadsheets, and extracted fields
Debug production data issues across pipelines, APIs, databases, and client systems
Make data integrations repeatable instead of rebuilding every connection from scratch

Data Quality, Governance, and Reliability

Build checks for data quality, completeness, freshness, duplication, and consistency
Set up monitoring and alerting for data pipeline failures and data reliability issues
Create practical data governance standards for client deployments
Support secure handling of sensitive operational, financial, and client data
Maintain clear data lineage, audit trails, and access control where needed

What We’re Looking For

10+ years of experience in data engineering, analytics engineering, platform data engineering, or backend data systems
You have built and operated production data pipelines used by real systems, not just dashboards
Strong experience with SQL, data modeling, ETL/ELT, orchestration, APIs, and cloud data infrastructure
Experience with modern data warehouses, databases, and transformation frameworks
You understand how to work with messy operational data, not only clean analytics datasets
You can design data models that reflect real business entities, processes, rules, and exceptions
You are comfortable working with structured, semi-structured, and unstructured data
You understand data quality, lineage, access control, monitoring, and production reliability
You can debug complex issues across source systems, pipelines, transformations, APIs, and application logic

Bonus Points

You have built data infrastructure for AI, automation, LLM, or agent-based systems
You have experience with vector databases, embeddings, retrieval systems, or knowledge graphs
You have worked with document processing, OCR, data extraction, reconciliation, or human-in-the-loop review workflows
You have built integrations with systems like Salesforce, HubSpot, Zoho, Microsoft 365, Google Workspace, Slack, Teams, SharePoint, NetSuite, SAP, Oracle, or finance/ERP tools
You have experience with dbt, Airflow, Dagster, Prefect, Kafka, Snowflake, BigQuery, Redshift, Postgres, or similar tools
You understand multi-tenant data architecture, role-based access control, audit logs, and client-specific data separation

What This Role Is Not

This is not a reporting or dashboard-only role. The data layer you build will power real AI workflows and operational systems.

This is not a clean-room analytics role where every source is documented and every table is reliable. You will work with messy, incomplete, inconsistent client data.

This is not a role where someone else defines the data architecture and you only implement tickets. You will be expected to form a view and create the standards Algebra scales on.

If your best work requires a mature data platform, a large team, and stable requirements, this is probably not the right fit. If you want to build the data backbone of an AI operations company from the ground up, we want to talk.

About Algebra

We are not consultants. We do not sell software licenses. We are the accountable operator.

The Role

This is a senior data engineering role for someone who can turn messy business data into the foundation for reliable AI workflows.

This is a builder role. You will help define the data patterns, standards, and infrastructure Algebra uses as it scales from custom client deployments into repeatable systems.

If your instinct is to find the truth inside messy data, structure it properly, and make it useful for systems that actually run the business, this is the role.

What You'll Do

Data Infrastructure and Pipelines

Build production-grade data pipelines that support Algebra’s AI-powered workflows
Design data ingestion patterns across client systems, APIs, databases, spreadsheets, documents, and third-party tools
Build transformation logic that turns fragmented operational data into usable workflow context
Own data movement across staging, production, and client-specific environments
Set up orchestration, scheduling, monitoring, and failure handling for critical data flows
Create reusable data pipeline patterns for repeatable client deployments

Data Modeling and Workflow Context

Design data models that represent real business workflows, entities, rules, and operational states
Build the structured data layer that allows AI agents to reason over client operations
Translate messy business processes into clean schemas, relationships, and workflow-ready data structures
Support use cases such as document intake, reconciliation, approvals, reporting, exception handling, and operational alerts
Work closely with application and AI engineers to make sure data is usable inside actual product experiences
Build systems that preserve context, lineage, and auditability where required

Integrations and Data Sources

Connect to client systems, CRMs, ERPs, databases, finance tools, communication platforms, and document repositories
Build reliable API integrations, sync jobs, webhooks, and ingestion workflows
Handle authentication, permissions, data mapping, schema changes, and integration failure states
Work with unstructured and semi-structured data, including PDFs, documents, emails, spreadsheets, and extracted fields
Debug production data issues across pipelines, APIs, databases, and client systems
Make data integrations repeatable instead of rebuilding every connection from scratch

Data Quality, Governance, and Reliability

Build checks for data quality, completeness, freshness, duplication, and consistency
Set up monitoring and alerting for data pipeline failures and data reliability issues
Create practical data governance standards for client deployments
Support secure handling of sensitive operational, financial, and client data
Maintain clear data lineage, audit trails, and access control where needed

What We’re Looking For

10+ years of experience in data engineering, analytics engineering, platform data engineering, or backend data systems
You have built and operated production data pipelines used by real systems, not just dashboards
Strong experience with SQL, data modeling, ETL/ELT, orchestration, APIs, and cloud data infrastructure
Experience with modern data warehouses, databases, and transformation frameworks
You understand how to work with messy operational data, not only clean analytics datasets
You can design data models that reflect real business entities, processes, rules, and exceptions
You are comfortable working with structured, semi-structured, and unstructured data
You understand data quality, lineage, access control, monitoring, and production reliability
You can debug complex issues across source systems, pipelines, transformations, APIs, and application logic

Bonus Points

You have built data infrastructure for AI, automation, LLM, or agent-based systems
You have experience with vector databases, embeddings, retrieval systems, or knowledge graphs
You have worked with document processing, OCR, data extraction, reconciliation, or human-in-the-loop review workflows
You have built integrations with systems like Salesforce, HubSpot, Zoho, Microsoft 365, Google Workspace, Slack, Teams, SharePoint, NetSuite, SAP, Oracle, or finance/ERP tools
You have experience with dbt, Airflow, Dagster, Prefect, Kafka, Snowflake, BigQuery, Redshift, Postgres, or similar tools
You understand multi-tenant data architecture, role-based access control, audit logs, and client-specific data separation

What This Role Is Not

This is not a reporting or dashboard-only role. The data layer you build will power real AI workflows and operational systems.

This is not a clean-room analytics role where every source is documented and every table is reliable. You will work with messy, incomplete, inconsistent client data.

This is not a role where someone else defines the data architecture and you only implement tickets. You will be expected to form a view and create the standards Algebra scales on.

Sr. Data Engineer

Job Description

The Role

What You'll Do

Apply now

Stay at the forefront
of market

Sr. Data Engineer

Job Description

The Role

What You'll Do

Apply now

Sr. Data Engineer

Job Description

The Role

What You'll Do

Apply now

Stay at the forefront of market

Sr. Data Engineer

Job Description

The Role

What You'll Do

Apply now

Stay at the forefront
of market