
Journeys to Democratize Data+AI
By Sandeep Uttamchandani
Our mission with this podcast is to share knowledge and experiences so the power of Data+AI can create a data-driven world providing equal opportunities for everyone!

Journeys to Democratize Data+AI Dec 11, 2021

Sandeep's Quicktake: Getting teams to not just look at wrong labels in ML data
Democratize is not one big strategy but 100s of small things that you put in place. Quicktake is a series of short practical tips to get you started towards making data and AI widely accessible and self-service within your organization.
This tip covers an important myth: To improve model accuracy, start by verifying the correctness of labels. Typically, there is only a small percentage of miss predictions that are related to wrong labels. Often times the biggest reason for model inaccuracies is the poor quality of data samples. Train the team just that instead of jumping to fix the incorrect labels, start by analyzing a sample of results that were misclassified and make a judgment call on whether to invest in fixing the labels going back and looking at opus deposit useful.

Sandeep's Quicktake: How to handle misclassified predictions
Democratize is not one big strategy but 100s of small things that you put in place. Quicktake is a series of short practical tips to get you started towards making data and AI widely accessible and self-service within your organization.
This tip covers 3 recipes on handling misclassified ML predictions within your product.

Journey to democratize AI for digital agriculture
In this episode, my guest is Daniel Mccaffery. Daniel is a technology thought leader driving Data and Analytics at Climate Corporation (a division of Bayer).
Daniel shares his insights on using ML to provide personalized recommendations for helping farmers grow crops with higher yield, profitable and sustainability. This involves deciding the right seed, right crop protection, density levels across different parts of the farm, etc. This is a fascinating example of AI and physical sciences coming together to build an innovative product offering. Daniel and I had a blast covering several topics: the building of models, model deployment and re-training, explainability for farmers to understand the recommendations, managing bias, experimentation A/B testing, monitoring drifts, data labeling, and perspectives on key bottlenecks in going from idea to ROI.

Democratizing Data Governance with Data Products at ING Bank France
In this episode, Samir Boualla shares the journey to democratize Data Governance across business teams (with Data Literacy & Data Protection). He also discusses how they build internal and external-facing data products to expedite the journey to self-service Data.
At ING Bank France, Samir is the Chief Data Officer responsible for several teams governing, developing, and managing data infrastructure and data assets to deliver value to the business. With over 20+ years of experience on various data topics. Samir shares interesting battle-tested techniques in this podcast: a process catalog, having a "data minimum standard," change management mindset, applying transfer learning.

Democratizing Data Quality at LinkedIn - Part 2
In this episode (part 2), Kapil Surlaker shares the journey to democratize data quality at LinkedIn scale!
Kapil has 20+ years of experience in data and infrastructure both at large companies such as Oracle as well as multiple startups. At LinkedIn, Kapil has been leading the next generation of Big Data Infrastructure, Platforms, Tools, and Applications to empower Data Scientists, AI engineers, App developers, to extract value from data. Kapil's team has been at the forefront of innovation driving multiple open source initiatives such as Apache Pinot, Gobblin, DataHub.

Democratizing metadata management and data access APIs at LinkedIn - Part 1
In this episode (Part 1), Kapil Surlaker shares the challenges and experiences in successfully democratizing metadata management and data access APIs across LinkedIn. In the next part, we deep-dive on data quality.
Kapil is the VP of Engg, and Head of Data at LinkedIn. He has 20+ years of experience in data and infrastructure both at large companies such as Oracle as well as multiple startups. At LinkedIn, Kapil has been leading the next generation of Big Data Infrastructure, Platforms, Tools, and Applications to empower Data Scientists, AI engineers, App developers, to extract value from data. Kapil's team has been at the forefront of innovation driving multiple open source initiatives such as Apache Pinot, Gobblin, DataHub.

Journey to recruit and build Data Science at Etsy
In this episode, Chu-Cheng covers experiences in recruiting and building a Data Science team from scratch.
Chu-Cheng is the Chief Data Officer (CDO) at Etsy. Chu-Cheng leads the global data organization responsible for data science strategy, AI innovation, machine learning & data infrastructure. Prior to Etsy, Chu-Cheng led various data roles at Amazon, Intuit, Rakuten, and eBay. Chu-Cheng is a Ph.D. in computer science, with published papers in key AI/ML conferences.

Using Data Science to democratize traditional market data analysis at PFM
In this episode, Manish Chitnis covers experiences in using Data Science to democratize traditional market data analysis (for hedge fund investment decisions).
Manish Chitnis is the Chief Data Officer (CDO) at Partner Fund Management (PFM). Manish has 20+ years of diverse multi-disciplinary experience across a wide range of analytics: architecting the data warehouses from scratch, building risk/data apps, introducing new data architectures, instituting data governance/stewardship, data-hygiene and cleanup, improved data collection, and much more.

Democratize by developing data literacy and standardized metrics at Tailored Brands
In this podcast, Meenal Iyer shares the journey to democratize by growing data literacy, creating a data-driven culture, and standardization of business metrics.
Meenal Iyer is a Data and Analytics leader at Tailored Brands. She brings in 20+ years of data analytics experience across multiple domains namely retail, travel, financial services. Meenal has been transforming enterprises to become data driven, and shares interesting domain agnostic lessons from her experience.

Journey to develop a self-service data platform in FinTech
In this episode, Keyur Desai shares the journey of building a data strategy & a pervasive self-service analytics platform. He discusses some really valuable lessons based on his extensive experience.
Keyur Desai is the former CDO of TD Ameritrade. He is a data executive with over 30 years of experience managing and monetizing data and analytics. He has created data-driven organizations and driven enterprise-wide data strategies, data literacy, modern data governance, machine learning & data science, pervasive self-service analytics, and several other initiatives. He has experience across multiple industries including Insurance, Technology, Healthcare, Retail.

Journey to democratize business metrics & experimentation at Intuit
In this episode, Anil Madan shares the journey at Intuit as they democratize business metrics and experimentation within the company. Anil is the VP of Data and Analytics for Intuit’s Small Business and Self Employed group. He has over 25 years of experience in the data space across Intuit, PayPal, eBay — a pioneer in building data infrastructure and value creation in the form of products, experimentation, data advertising, digital marketing, payments, and many more.