Location: Montreal, Quebec
Our client is building a next-generation, AI-native social platform designed with safety and creativity at its core. Their mission is to provide a secure environment where young audiences can create, collaborate, and grow. The client’s next phase is building personalized experiences that evolve with each user’s interests and moods.
Backed by top investors, with a $5million USD Seed, the client is hiring a mission-aligned AI engineer to help power the future of personalized content and community.
What You’ll Do:
•Deliver high-impact backend solutions with extensive hands-on experience across the AWS ecosystem, specifically utilizing ECS, S3, Lambda, EC2, and various managed database services.
•Develop production-grade systems using Node.js and Python, with the technical versatility to apply Rust or C/C++ when optimizing for high-performance requirements.
•Apply expert-level SQL and NoSQL knowledge to manage PostgreSQL, DynamoDB, and Redis environments, prioritizing horizontal scaling and query optimization.
•Execute forward-looking capacity audits and lead the design of durable infrastructure to support long-term organizational growth.
•Partner closely with Machine Learning teams to engineer the foundational pipelines required for large-scale model training and high-availability inference.
•Construct automated data handling workflows that simplify the path from raw data preprocessing and labeling to final model production.
•Architect storage and compute frameworks specifically tuned to minimize latency and cost during the training and deployment phases.
•Advance the maturity of observability platforms by integrating Prometheus and Grafana with unified telemetry data and system-wide health metrics.
•Implement proactive incident detection by configuring smart alerting systems that rely on anomaly patterns and specific performance thresholds.
•Strengthen system hardening by conducting regular penetration testing and simulated breach scenarios to remediate potential security gaps.
•Provide reliable incident oversight during Eastern Time business hours, ensuring rapid restoration of services and the implementation of permanent fixes.
•Engineer specialized SRE AI agents designed to automate routine troubleshooting and streamline complex system monitoring tasks.
Must Have Skills:
•6+ years of professional experience engineering backend systems, with a specific emphasis on the AWS cloud environment.
•Strong command of Node.js and Python for production systems;
•Demonstrate a deep mechanical sympathy for system internals, prioritizing a fundamental understanding of architectural logic over surface-level fixes.
•Expert level experience building and maintaining software services in the AWS ecosystem
•Knowledge of observability practices, and security best practices.
•Experience building or supporting ML/data pipelines is a plus, or proven willingness to learn.
•A B.Sc. in Software engineering or similar
•AWS Solutions Architect Associate or similar is a plus
•Senior-level SQL skills with experience in both relational databases (PostgreSQL) and NoSQL solutions (DynamoDB, Redis), including performance optimization and scaling strategies
Nice to Have Skills:
•Familiarity with C/C++ or Rust for performance-critical components is a plus