In God we trust, all others bring data.
Hello, I am Lulu, I work on data and technology.
I am an abstract thinker, both mathematically and philosophically. I build and
architect distributed systems for data and AI/ML applications. I enjoy good
design with technology, always look for ways to make the world a better place.
tl;dr
Graduated with MS in Computer Science specialized in machine learning and AI,
MA in Economics and BA in Political Science.
Currently I am software engineer at Block. Previously I was master data engineer at Capital One, senior
data engineer at Deloitte building streamlined and automated machine
learning pipeline to shorten data science's experiment to production time.
While not building new stuff, I enjoy thinking how to better software and
infrastructure by design and leverage/re-purpose/combine new
technologies to help organizations and people do things better and quicker.
Data has always been the center of my interest. Over past 7 years, I've worked
with data in petabyte scale on 200 nodes cluster and few thousand records
survey data in pandas, both exploratory analysis and production grade ETL
architecture design and development.
I've run what people used to call 'statistical' analysis on survey data as
well as large scale distributed machine learning algorithms on sales data.
I've worked with various business functions - customer service, supply chain,
manufacturing, market research, politics, health care to develop customized BI
solutions from infrastructure to ETL pipelines to dashboards. I've built in
d3.js
(free) as well as tableau ($$$$) -- with a tiny team of 5
to enterprise with teams across 5 continents.
I am always excited about new data problems everyday whether that's
infrastructure or application or analytics to support product Technology is
just moving so fast, no two days are the same.
- to Computers
Scala, Python, Go, Java, AWS, Bash
- to Data
Hive, Presto, Spark, SQL, R, Matlab, SPSS, SAS, STATA
- to Web Browsers
html, css, javascript, python-flask, scala-play, d3.js, dc.js
- to Real People
English and Mandarin
Where I Learn ¶
- I hold an MS in Computer Science from Gerogia Tech, where I focus on Machine Learning and AI.
- Prior to this, I graduated with MA in Economics from Syracuse University (NY), where I specialized in Econometrics in 2013. I’ve had my BA in Political Science from National Chenchi University (Taipei, Taiwan) back in 2012.
Where I Work ¶
Currently I am Software Engineer at Square.
Prior to that, I was Master Data Engineer at Capital One (New York, NY) working on Identity Verification, Fraud and ML Platform.
Prior to that, I was Senior Data Engineer at Deloitte Digital (New York, NY), where I …
- Build integrated and automated AI/ML infrastructure and toolings to simplify and shorten data science’s experiment to production cycle.
- Research, evaluate, implement and contribute open source AI/ML toolings. Advocate for adoptions and coordinate migration process across group-wide teams to improve overall quality and efficiency of AI/ML applications.
Prior to that, I was Software Engineer at PeerIQ (New York, NY), where I …
- Breaking legacy system into more microservice and container-based architecture. Swapping big-boxes with more distributable, flexible serverless infrastructure with minimal code changes.
- Building big data applications and systems to analyze hundreds of billions of consumer credit records in spark, scala, hive, presto or whatever that works
Prior to that, I was data analyst at McGraw Hill Education (New York, NY), where I found with data that
- Business students are more likely to take science courses than science students to business courses.
- How many days our customer on average have to wait for their orders globally.
Prior to that, I was Statistical Analyst at Radius Global Market Research (New York, NY), where I design and develop experiments on web to collect data that
- Show consumer segment that only buys coke when it’s comboed with root beer
- Help dogs and cats get adopted by optimizing their profile photo
I do Pro-Bono work, all the time! ¶
I worked with Datakind (New York, NY) to
- help Threshold to setup analytical data warehouse and build dashboard with flask+d3.
In 2014, I worked on UN OCHA's Humanitarian Data Exchange to
- Text scraping ten years worth of UN’s humanitarian documents.