Machine Learning Engineer

Proof • Full-time • Remote (Canada, CA) • 2m ago

We're Proof, a fast-growing startup in the legal tech industry. Our best-in-class legal services platform is used by thousands of law firms throughout the US, and we've recently raised our Series B from top-tier investors. We're looking for a talented Machine Learning Engineer to join our world-class team and drive the application of AI and ML to make our operations more efficient. This is an exciting opportunity to establish a new job function at a rapidly scaling company and work closely with our CTO and VP of Product.

At Proof, we believe in the power of experimentation and data-driven decision making. While we don't see AI as a silver bullet, we are convinced that our unique position in the legal industry provides us with a real use case where this technology can drive immense business value and become our competitive advantage. Our extensive dataset of legal documents and deep understanding of the legal domain puts us in a prime position to harness AI and ML to streamline our operations and provide unparalleled service to our clients.

As our first Machine Learning Engineer, you will play a pivotal role in defining and refining our AI strategy from its inception. You'll tackle intricate challenges, help shape the tools and techniques we employ daily, and witness firsthand how your contributions directly influence our business outcomes and customer experience. If you're passionate about applying your expertise to real-world problems and being at the forefront of AI innovation within the legal sector, this role offers an ideal platform for your career growth.

What you’ll do:

Develop and optimize NLP, NER, and classification models to automatically extract information from legal documents, reducing manual transcription
Collaborate with engineering to build data pipelines and deploy, monitor, and maintain ML models into our production environments using tools such as Jupyter Notebook, Docker, and Kubernetes.
Conduct statistical analysis, including descriptive statistics and predictive modeling, on our business data to generate actionable insights and guide pricing experiments using tools like Pandas, NumPy, and Scikit-Learn.
Perform outlier analysis to identify and handle anomalies in our datasets, ensuring data quality and model robustness.
Create informative data visualizations to communicate complex findings and aid in decision-making processes using libraries like Matplotlib, Seaborn, and Plotly.
Utilize probabilistic graphical models and Bayesian statistical modeling techniques where appropriate to model uncertainty and make probabilistic predictions.
Fine-tune and adapt existing ML models for our specific use cases, balancing performance and cost-effectiveness.
Maintain and optimize our data warehouse and ETL pipelines to ensure data quality and availability (we use Airbyte and BigQuery).
Stay updated with the latest advancements in ML and data science research, applying relevant techniques to enhance our solutions.

What we’ll expect you to know on day one:

5+ years of professional experience working in data science, machine learning, or a related field
Strong proficiency in Python and experience with relevant NLP & deep learning libraries (e.g. TensorFlow, PyTorch, spaCy)
Solid understanding of ML techniques and experience with tasks like information extraction and document/text classification.
Ability to perform descriptive and predictive statistical analysis, outlier analysis, and data visualizations using tools like SQL, Spark, and Hadoop.
Experience with development and deployment tools such as Git, Jenkins, and AWS/GCP/Azure.
Familiarity with probabilistic graphical models and Bayesian statistical modeling.
Familiarity with agile development methodologies and collaborative tools like Jira or Linear.
Excellent communication skills and ability to collaborate cross-functionally.
Self-starter mindset and excitement about applying ML to real-world business problems.

Great to have, but not required:

Advanced degree in Computer Science, Statistics, or a related quantitative field
Familiarity with optimization techniques to balance the needs of multi-sided marketplaces
Expertise using OCR to extract data from large PDF documents
Previous experience working on a Marketplace tech product

Compensation and Benefits:

100% remote, work from anywhere in Canada
Flexible paid time off and holidays
Equipment provided
Health care, vision, dental, disability insurance, and 401K options