Machine learning security

Oct 22, 2021

The folks over at NVIDIA released a new language model that is very large, “Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model.”[1] At 530 billion parameters that is a very large generative language model indeed. All of these model releases and the sharing of code that go on within the machine learning space on GitHub and via other means create a situation where both the shared security in the space and the security of each individual project is important.

A lot more articles over in Google Scholar showed up for “machine learning security” than I expected to see this week.[2] Some of that content veers off into privacy related items and some very specific use cases related to the security related applications of machine learning. Thanks in part to the islands of content on this one it is probably best to try to consider both the machine learning use cases related to security and the very real and growing questions around the security of actual machine learning implementations, code, and shared projects. A lot of the practical open source MLOps projects that I track on GitHub are scanned all over the place and people put in pull requests and communicate out the potential problems that might exist. Every once in a while you might even get to read about a temporary private fork of one of those MLOps projects where somebody is working very hard to patch some block of code to the point it passes scanning and other security measures.[3] That is one way to really work something to resolution, but it is a very interesting exercise as it is not part of driving the product features, but it is critical to being able to use the product in enterprise settings.

Detecting statistical anomalies is something that machine learning implementations are fully capable of doing. One of the use cases that people seem to like about relates to cyber security intrusion detection.[4] A lot of security is about seeing things within the normal pattern of usage that flag as abnormal. That is one of the reasons why monitoring and traffic analysis is such a big part of security and machine learning can play a vital part in that type of work. To that end, I spent some time reading an article from the Cisco team about security applications that utilize machine learning in practice.[5] It was a decent read about how machine learning can augment security use cases, but I was trying to dig more into the nature of how security is worked within the actual machine learning instances. Software in the AIOps and MLOps spaces exists and is rapidly changing every day you can see the updates on GitHub. The security of those applications is where my attention was during all of my searching. You can see the security pushes and pulls within the GitHub repositories of the MLOps software I watch on a regular basis. My plan is to circle back with an analysis of those patterns to see if I can isolate security related items.

Links and thoughts:

Linus and Luke were back in the studio again this week and I listened to them chat about things during the course of writing this missive, “Best Buy Scalping PS5s... for SHAME - WAN Show October 15, 2021”

You can check out the Microsoft Developer AI show this week, “AI Show Live - Episode 35 - Building computer vision models using AutoML for Images”

Here is the keynote (high production informational) from Google Cloud Next if you wanted to catch up on that one, “Google Cloud Next Developer Keynote”

Top 6 Tweets of the week:

Stanley H. Chan @stanley_h_chan

After two years of writing + editing, the (proof-read) hard copy is finally here! Thank you to everyone who helped with this book. I'll tweet about a few unique things about this book. To get a PDF copy of this book: probability4datascience.com #datascience, #MachineLearning

TensorFlow @TensorFlow

🛠️ The TensorFlow Model Optimization Toolkit now lets you combine multiple optimization techniques, such as clustering, pruning and quantization. Learn more about collaborative optimization here → goo.gle/3DnG6WW

Casey Newton @CaseyNewton

“First, the idea that in 2021, the top performing link post on Facebook only has 9,000 shares is, honestly, horrifying.”

Ryan Broderick @broderick

Wondering how healthy Facebook is as a website right now? One of the top posts on U.S. Facebook in October was a 5-year-old story about a dog posted by an automated Facebook page for a website that hasn't updated in over a year. https://t.co/txMzZNCY74 https://t.co/Ks5BQLc3Gz

Dr. Nels Lindahl @nelslindahl

From @nvidia "Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model | NVIDIA Developer Blog"

developer.nvidia.comUsing DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model | NVIDIA De…MT-NLG has 3x the number of parameters compared to the existing largest model of this type and demonstrates unmatched accuracy in a broad set of natural language tasks…

Dr. Nels Lindahl @nelslindahl

From @AP White House proposes tech ‘bill of rights’ to limit AI harms

apnews.comWhite House proposes tech ‘bill of rights’ to limit AI harmsTop science advisers to President Joe Biden are calling for a new “bill of rights” to guard against powerful new artificial intelligence technology. The White House’s Office of Science and Technology Policy on Friday launched a fact-finding mission to look at facial recognition and other biometric…

Dieter Bohn @backlon

VERGECAST! Two Alexes enter, one Alexa gets roasted. @alexhcranz and @alexeheath join @reckless and I to talk Amazon, Google, and the Code Conference.

theverge.comVergecast: Amazon’s fall hardware event, Google’s Search On event, and Code Conference 2021The Vergecast discusses all the products announced at Amazon’s fall devices event and Google’s Search On fall event. Verge senior reporter Alex Heath joins to discuss news that came out of the interviews at Code Conference.

Footnotes:

[1] https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/

[2] https://scholar.google.com/scholar?q=machine+learning+security&hl=en&as_sdt=0&as_vis=1&oi=scholart

[3] https://docs.github.com/en/code-security/security-advisories/collaborating-in-a-temporary-private-fork-to-resolve-a-security-vulnerability

[4] https://ieeexplore.ieee.org/abstract/document/7307098

[5] https://www.cisco.com/c/en/us/products/security/machine-learning-security.html#~how-ml-works

What’s next for The Lindahl Letter?

Week 40: Applied machine learning skills
Week 41: Machine learning and the metaverse
Week 42: Time crystals and machine learning
Week 43: Practical machine learning
Week 44: Machine learning salaries

I’ll try to keep the what’s next list forward looking with at least five weeks of posts in planning or review. If you enjoyed reading this content, then please take a moment and share it with a friend.

The Lindahl Letter

Discussion about this post

Ready for more?