Shifting the Pareto Frontier of Enterprise AI with Automated Prompt Optimization

September 28, 2025 in Machine Learning

This week I published a research blog post with Databricks on how we shift the Pareto frontier of enterprise agents using automated prompt optimization.

Joining Databricks Mosaic

December 02, 2024 in Personal Growth

I am thrilled to be joining the Databricks Mosaic team to work on applied AI research. This is a fantastic opportunity for me to continue pushing the boundaries of AI advancement and contribute to building a successful AI platform for enterprises.

From Vienna to Paris: A Collection of My Favorite Travel Photos in Europe

December 01, 2024 in Journal

My girlfriend and I recently traveled to Europe for a conference as well as a vacation. We had a great time walking and exploring each of the cities that we visited. In this post, I am sharing my favorite photos captured in each city.

My Ray Summit Talk - Building Multi-Modal Foundation Models for Document Automation

November 03, 2024 in Machine Learning

I gave a presentation at the Ray Summit on my work building Multimodal Foundation Models for Document Automation at Uber. It is always a great pleasure to publicly share what I have been building over the past year!

Release of Levanter 1.0

June 16, 2023 in Machine Learning

Today at Stanford, we released Levanter, a Jax-based framework for training foundation models. It is now open-source on Github under Apache 2.0.

Things I Learned at Landing AI

April 14, 2023 in Personal Growth

Over the past four and a half years at Landing AI, I have had the incredible opportunity to work with Andrew Ng, Dillon Laird and other amazing people to build AI applications across various industries. Each project has brought its unique challenges, pushing me to dive deeper into the ever-evolving world of AI. As I look back at this enriching journey, I am grateful and humble to share the lessons that I've learned in the hope of inspire others in the field.

Fast and Simple Image Search with Foundation Models

March 19, 2023 in Machine Learning

In this blog post, I will walk you through how to build a fast and simple image search tool. I developed an image search application that uses multimodal foundation models to search for highly accurate and relevant results. By following this blog post and our code base, you can easily build one yourself!

The Making of LandingLens AI Platform: Motivation and My Favorite Features

March 04, 2023 in Product

Last week, at Landing AI, we publicly launched our flagship AI platform, LandingLens. This all-in-one platform empowers users to build a computer vision application from start to deployment. In this blog, I want to share the motivation behind building this AI platform as well as highlight a few key features that I truly enjoy!

Paper Explained - LAION-5B

December 04, 2022 in Machine Learning

In this blog post, I cover one of the awarded papers in NeurIPS 2022. This paper presents LAION-5B, a dataset consisting of 5.9 billion image-text pairs, to further push the scale of open datasets for training and studying state-of-the-art language-vision models. With this large scale, it gives strong increases to zero-shot transfer and robustness.

Build an Automated Cross-Domain Question Answering System

April 23, 2022 in Machine Learning

Question Answering models are often used to automate the response to human questions by leveraging a knowledge base. My team at Stanford aims to build a robust question answering system that works across datasets from multiple domains. We explore two transformer-based Sparsely-Gated Mixture-of-Experts architectures and conduct an extensive ablation study to reach the best performance.

The Importance of Metrics in Machine Learning and How to Use Them

February 05, 2022 in Machine Learning

Metrics are critical in machine learning projects. They help a team to prioritize their resources and concentrate on a single, clear objective. I am always amazed to see that, once my team is aligned on a single metric to optimize, the speed and momentum we will be able to execute. In the end, we will usually be able to accomplish the goals that seem impossible in the beginning.

Model Training with Machine Learning

August 23, 2021 in Machine Learning

Based on our past experience at Landing AI we have developed best practices for model training and evaluation. In this article, I share a few high-priority tasks during model training. We openly share our guiding principles to help machine learning engineers (MLEs) through model training and evaluation.

Data Labeling of Images for Supervised Learning

August 23, 2021 in Machine Learning

At Landing AI we observed how many projects took an unnecessarily long and painful process to complete. It was due to ambiguous defect definitions or poor labeling quality. In comparison, it will make the life of machine learning engineers much easier, and the whole project lifespan much shorter, by having a dataset with high quality labels. Therefore, it is very important to invest the time in the project’s early stage to clarify defect definitions and formalize labeling.

Data Validation for Machine Learning - Paper Reading Note

August 22, 2021 in Machine Learning

This paper reminds me of many time where our model in production perform strangely, so engineers have to spend hours investigate root causes and roll back or push for fixes. Lots of late night works as result of such mistakes. I agree with this paper that such data validation systems, if implemented correctly, can really help save significant amount of engineer hours by catching important errors proactively and diagnose model errors more efficiently.

Designing Image Acquisition for Machine Vision

January 25, 2021 in Machine Learning

At Landing AI, I have gone through several projects where we developed an end-to-end machine-learning system from “scratch'“. That means before we started on the project, there was no existing data collection procedure, so we had to start from zero and set up cameras.

Apple Fitness+

December 23, 2020 in Journal

I am looking forward to a more radical redesign of gym space that well integrates with these virtual workout services and elevate the users’ experience around workout.

Two Types of Full Stack Machine Learning Engineering

October 05, 2020 in Machine Learning

There are two types of Full Stack Machine Learning Engineering in my mind — one vertical and one horizontal

My First Week of CS330 at Stanford

September 21, 2020 in Machine Learning

Meta Learning is one of the promising lines of work that aim to solve the small data problems in machine learning field. Currently, many people working on AI are thinking day and night about how to scale AI systems and improve their profit margins. One main challenge to solve is how to quickly build an AI model that reaches human-level performance on classes with only a few samples.

Tech Talk on Photography →

September 13, 2020 in Machine Learning

I did this tech talk at Landing AI this week and I’d like to share it out on my website. It’s about the core concepts in photography and a little on the recent trend of computational photography.

I removed a section that talks about the imaging solution design in one of our internal projects, due to IP restriction. I will later share a blog post on that topic.

Written Languages

August 21, 2020 in Journal

Today, I read the “memory overload” section in “Sapiens”, where I learned that the first few generations of the written languages were developed for the purpose of book-keeping. The written languages were developed for accounting and taxation of ancient empires.