Do most ML projects fail?

Oct 15, 2021

You have probably heard that around 85% of machine learning projects fail. That statistic comes from a 2018 Gartner report (press release) that contained the following, “Gartner predicts that through 2022, 85 percent of AI projects will deliver erroneous outcomes due to bias in data, algorithms or the teams responsible for managing them.”[1] For the last 38 weeks (during the course of publishing this Substack) I have been talking about the necessity of having a top level machine learning strategy that prioritizes use cases based on return on investment.[2] You have a ton of pundits and other pontificators giving guidance on how to make your machine learning project successful.[3] To me it is about having a machine learning strategy and executing in a planful way that highlights a definable and repeatable plan to success.

We can fast forward a few years and catch up to current from a newer report Gartner teams shared, “Gartner research shows only 53% of projects make it from artificial intelligence (AI) prototypes to production. CIOs and IT leaders find it hard to scale AI projects because they lack the tools to create and manage a production-grade AI pipeline.”[4] I guess that means the trajectory of success has improved in the last few years, but the prediction from the Gartner team that most (more than half) of ML projects will fail is consistent. This is one of the reasons that I spend so much of my time and speaking appearances talking about open source MLOps efforts. That is where I see the most people working together to overcome the obstacles described above.

Q: What do you do to help prevent your ML projects from failing?
A: Use the patterns that are known to work. Full stop.

A lot of machine learning use cases have been demonstrated to be effective. Start out by targeting in and deploying similar use cases. That is the best method of getting into the machine learning ecosystem and probably getting a solid return on investment from your ML strategy. Doing something that is definable and repeatable from an external API that has been proven durable and dependable is always easier than trying to do something unique within your organization. Some of those use cases could be really rewarding, but they are going to be hard to drive toward success. To that end, people who are just looking to access a machine learning API from AWS, Azure, or GCP are going to find a much easier path to use case success. All of the work is happening to train, implement, and continuously improve the machine learning models by the provider. You also get the benefit of having many many users pressure testing the API vs. being siloed into a single service use case.

Links and thoughts:

This video from Google Cloud Tech’s #VMEndToEnd was pretty good this week, “How to save money with VMs”

This video from Google Cloud Tech’s #ArchitectingCloudSolutions was also descent this week, “How to architect a no-code ML platform on Google Cloud”

From Microsoft Developer on YouTube, “UNITED STATES

AI Show Live - Episode 34 Introduction to Deep Learning”

During the course of editing and extending this Substack missive I did listen to the WAN show with Linus and Luke, “I Have MORE to Say About Steam Deck - WAN Show October 8, 2021”

Top 5 Tweets of the week:

Yannic Kilcher, Tech Sister @ykilcher

Help out the channel and register for #GTC21. A very solid speaker lineup, and you get to hear the latest from @nvidia on GPUs, libraries, research, etc. Registration is necessary to watch, but is free, and with this link you can win a RTX 3090: nvda.ws/2Y2B5ni

Dr. Nels Lindahl @nelslindahl

‘A perfect storm’: supply chain crisis could blow world economy off course

theguardian.com‘A perfect storm’: supply chain crisis could blow world economy off courseFrom Liverpool to LA, shortages of energy, labour and transport are threatening recovery from Covid

Dr. Nels Lindahl @nelslindahl

Forbes: AlphaSense, An AI Search Engine For Corporations, Raises $180 Million In Series C Round Led By Goldman Sachs, Viking Global. forbes.com/sites/isabelco… via @googlenews

forbes.comAlphaSense, An AI Search Engine For Corporations, Raises $180 Million In Series C Round Led By Goldman Sachs, Viking GlobalThe market intelligence startup is looking to expand content offerings and invest in product development.

Hugging Face @huggingface

Can your model outsmart GPT-3 🤔? Together with @elicitorg we're launching the RAFT benchmark to test the few-shot capabilities of language models on real-world tasks 📊! Train your favorite models across multiple domains and submit predictions here 👇 raft.elicit.org/leaderboard

raft.elicit.orgRaft Leaderboard - a Hugging Face Space by oughtDiscover amazing ML apps made by the community

Dr. Nels Lindahl @nelslindahl

People are trying really hard to articulate the path to regulation on this issue. The call to action is clearer than the paths forward. WIRED: Americans Need a Bill of Rights for an AI-Powered World. wired.com/story/opinion-… via @googlenews

wired.comAmericans Need a Bill of Rights for an AI-Powered WorldThe White House Office of Science and Technology Policy is developing principles to guard against powerful technologies—with input from the public.

Footnotes:

[1] https://www.gartner.com/en/newsroom/press-releases/2018-02-13-gartner-says-nearly-half-of-cios-are-planning-to-deploy-artificial-intelligence

[2]

The Lindahl Letter

Have an ML strategy… revisited

Welcome to the 4th post in this ongoing Substack series. This is the post where I’m going to go back and revisit two very important machine learning questions. First, I’ll take a look back at my answers to the question, “What exactly is an ML strategy?” Second, that will set the foundation to really dig in and answer a question about, “Do you even need …

4 years ago · 1 like · Dr. Nels Lindahl

and pretty much every talk I have given in the last 3 years

[3] https://www.kdnuggets.com/2021/02/why-machine-learning-projects-fail.html or maybe this one https://towardsdatascience.com/why-production-machine-learning-fails-and-how-to-fix-it-b59616184604

[4] https://www.gartner.com/en/newsroom/press-releases/2020-10-19-gartner-identifies-the-top-strategic-technology-trends-for-2021

What’s next for The Lindahl Letter?

Week 39: Machine learning security
Week 40: Applied machine learning skills
Week 41: Machine learning and the metaverse
Week 42: Time crystals and machine learning
Week 43: Practical machine learning

I’ll try to keep the what’s next list forward looking with at least five weeks of posts in planning or review. If you enjoyed reading this content, then please take a moment and share it with a friend.

The Lindahl Letter

Do most ML projects fail?

Discussion about this post