About

 Full Resume

In God we trust, all others bring data.
The Elements of Statistical Learning

Hello, I am Lulu, I work on data and technology.


I am an abstract thinker, both mathematically and philosophically. I build and architect distributed systems for data and AI/ML applications. I enjoy good design with technology, always look for ways to make the world a better place. tl;dr

Graduated with MS in Computer Science specialized in machine learning and AI, MA in Economics and BA in Political Science. Currently I am software engineer at Block. Previously I was master data engineer at Capital One, senior data engineer at Deloitte building streamlined and automated machine learning pipeline to shorten data science's experiment to production time.

While not building new stuff, I enjoy thinking how to better software and infrastructure by design and leverage/re-purpose/combine new technologies to help organizations and people do things better and quicker.

Data has always been the center of my interest. Over past 7 years, I've worked with data in petabyte scale on 200 nodes cluster and few thousand records survey data in pandas, both exploratory analysis and production grade ETL architecture design and development.

I've run what people used to call 'statistical' analysis on survey data as well as large scale distributed machine learning algorithms on sales data. I've worked with various business functions - customer service, supply chain, manufacturing, market research, politics, health care to develop customized BI solutions from infrastructure to ETL pipelines to dashboards. I've built in d3.js (free) as well as tableau ($$$$) -- with a tiny team of 5 to enterprise with teams across 5 continents.

I am always excited about new data problems everyday whether that's infrastructure or application or analytics to support product Technology is just moving so fast, no two days are the same.

Languages I Speak

  • to Computers
    Scala, Python, Go, Java, AWS, Bash
  • to Data
    Hive, Presto, Spark, SQL, R, Matlab, SPSS, SAS, STATA
  • to Web Browsers
    html, css, javascript, python-flask, scala-play, d3.js, dc.js
  • to Real People
    English and Mandarin

Where I Learn

  • I hold an MS in Computer Science from Gerogia Tech, where I focus on Machine Learning and AI.
  • Prior to this, I graduated with MA in Economics from Syracuse University (NY), where I specialized in Econometrics in 2013. I’ve had my BA in Political Science from National Chenchi University (Taipei, Taiwan) back in 2012.

Where I Work

Currently I am Software Engineer at Square.

Prior to that, I was Master Data Engineer at Capital One (New York, NY) working on Identity Verification, Fraud and ML Platform.

Prior to that, I was Senior Data Engineer at Deloitte Digital (New York, NY), where I …

  • Build integrated and automated AI/ML infrastructure and toolings to simplify and shorten data science’s experiment to production cycle.
  • Research, evaluate, implement and contribute open source AI/ML toolings. Advocate for adoptions and coordinate migration process across group-wide teams to improve overall quality and efficiency of AI/ML applications.

Prior to that, I was Software Engineer at PeerIQ (New York, NY), where I …

  • Breaking legacy system into more microservice and container-based architecture. Swapping big-boxes with more distributable, flexible serverless infrastructure with minimal code changes.
  • Building big data applications and systems to analyze hundreds of billions of consumer credit records in spark, scala, hive, presto or whatever that works

Prior to that, I was data analyst at McGraw Hill Education (New York, NY), where I found with data that

  • Business students are more likely to take science courses than science students to business courses.
  • How many days our customer on average have to wait for their orders globally.

Prior to that, I was Statistical Analyst at Radius Global Market Research (New York, NY), where I design and develop experiments on web to collect data that

  • Show consumer segment that only buys coke when it’s comboed with root beer
  • Help dogs and cats get adopted by optimizing their profile photo

I do Pro-Bono work, all the time!

I worked with Datakind (New York, NY) to

  • help Threshold to setup analytical data warehouse and build dashboard with flask+d3.

In 2014, I worked on UN OCHA's Humanitarian Data Exchange to

  • Text scraping ten years worth of UN’s humanitarian documents.