Sr. Data Engineer - CANADA (Remote)
Description
Luxury Presence is building the AI growth platform for real estate. Backed by Bessemer Venture Partners and other top investors, we're a Series C company on track to hit $100M in annual recurring revenue in the next six months. More than 87,000 real estate professionals, including over 30% of the WSJ Real Trends top 100 agents in the United States, use us to run and grow their business.
The Opportunity We don't just use AI. We build with it, ship with it, and think with it. We're in the top 1% of companies applying AI effectively, not just to our products, but to how we build them. With an unlimited budget for Anthropic tokens, our engineers use AI agents to write, review, and ship production code every day. We're building toward a world where humans design systems and AI builds them, and we're already further along that path than almost anyone else. As a Sr. Data Engineer, you'll be a technical leader at the center of this transformation. You'll shape platform architecture, drive AI-powered product delivery, and raise the bar for what a small, AI-augmented engineering team can accomplish. This is a role for someone who already lives in their terminal with AI, who has strong opinions about how AI changes system design, and who wants to help define what a next-generation engineering organization looks like.
What You'll Do
Build and scale high-throughput streaming pipelines. Design, implement, and operate pipelines ingesting 400M+ monthly MLS updates across 350+ integrations using Airflow, Spark Streaming, Kafka, and Iceberg—ensuring reliability, performance, and data correctness. Model and deliver high-quality, production-grade real estate datasets. Develop and maintain datasets that power core product experiences, with a focus on data modeling, transformation logic, and balancing freshness, accuracy, and cost. Strengthen data quality and observability. Implement and improve data quality checks, monitoring, and alerting to detect issues early and reduce downstream impact. Leverage AI to improve data operations. Contribute to AI-driven tooling that helps triage, debug, and resolve data quality issues, increasing team efficiency and reducing manual intervention.
What We're Looking For
Attributes You already build with AI daily. You use Claude Code as a core part of your workflow, not as a novelty You have strong opinions, loosely held, about how AI changes software architecture, team structure, and engineering culture You think in systems. You connect technical decisions to customer outcomes and long-term business value You communicate clearly and directly. You can explain complex tradeoffs to product, design, and executive stakeholders You're energized by ambiguity and speed. You thrive in a fast-growing company where the roadmap evolves and ownership is real You like to have fun at work. We take our craft seriously, but we don't take ourselves too seriously. We celebrate wins, crack jokes, and genuinely enjoy building together
Skills and Experience 6+ years of professional data engineering or software engineering experience Strong experience with distributed data processing and streaming systems (Spark / PySpark, Kafka) Proficiency in Python (Pydantic preferred) and familiarity with Node/TypeScript is a plus Experience building and maintaining data pipelines on AWS using tools like Airflow, Spark Streaming, and Iceberg Solid understanding of data modeling and working with large-scale datasets Familiarity with event-driven systems and ingestion patterns (Kafka, SQS) Experience implementing data quality checks, monitoring, and debugging data issues Interest in applying AI/ML or automation to improve data workflows is a plus Proven track record leading high-impact initiatives from concept through production in a SaaS environment Expert-level grasp of software design principles and experience with multi-tenant platform architectures
Tech Stack
Backend
Python, PySpark, Pydantic, Node/TypeScript
Data
Iceberg, Postgres
Infrastructure
AWS, Kubernetes, Airflow, Spark Streaming
Messaging
Kafka, SQS