Prompt engineering and machine learning

Dec 04, 2021

During the very long car ride back to Kansas City from Denver for Thanksgiving this year, the whole family got to listen to an audiobook called “The Age of AI: And Our Human Future” by Daniel Huttenlocher, Eric Schmidt, and Henry Kissinger (2021). Outside of an oddly lengthy aside about the history of philosophical considerations going back to Plato that snuck into a chapter. You will know it when you get to it in the book. The book itself was a pretty good read and provided solid examples of where artificial intelligence currently stands. They were super quick to point out limitations at the point where the book was written.

One of the things about working within the space of modern machine learning is the very real necessity of prompt engineering and having people that understand it. Is this just a very fancy method of leading the witness? That could very well be the case. The cornucopia of large language models that are being released seemly monthly now are all pretty much interacted with based on prompt engineering.[1] The other day I spent some time looking at the Megatron-Turing large language model and trying to consider how much bigger a model could get before the langue it was modeling would be exhausted.[2] To me it was sort of a modernity adjacent version of the Borges' Map fable. To exhaustively map everything would seemingly be an effort to create a mirror of modernity. At 530 billion parameters it is an exceedingly large generative language model. These models are super interesting and you might recall that paper from a ton of Stanford University professors, “On the Opportunities and Risks of Foundation Models.”[3] This paragraph took a bit to get to the point of this missive that these types of models are here to stay and they require a special skill set called prompt engineering.

To engage in prompt engineering people are going to face challenges getting the expected outputs.[4] You work with the model by giving it prompts or writing a predefined prompt package to make it create outputs from the model. During the course of building my own bot using the GTP-2 model I learned really quick that even with millions of words of my own writing corpus it required the right type of prompt to get output from the model that matched my writing style.[5] That is what makes prompt engineering so interesting in terms of this supremely large language models. People spend so much effort compiling, training, and packaging these generative models (foundation models) that needing a specialized skill set to work with them is an interesting situation to have occurred.

Links and thoughts:

“AI Show Live - Episode 41 - Best of AI Show Holiday Edition!”

“PC Gaming is Officially the BEST! - WAN Show November 26, 2021”

Top 6 Tweets of the week:

Berkeley AI Research @berkeley_ai

In an article published by @WIRED, Mike Jordan @UCBerkeley, Daron Acemoglu @MIT and Glen Weyl @MSFTResearch assert that society would benefit far more by developing AI approaches that complement human intelligence, rather than trying to imitate it.

data.berkeley.eduData Scientist Michael Jordan Calls for a More Economics-Aware Approach to AIAs artificial intelligence (AI) takes on an ever-higher profile in discussions about the economy, social and political issues and what the future holds, a trio of data science experts are calling for a reset in our mindset, and point to a 1950 paper written by noted mathematician Alan Turing.

Yannic Kilcher, Tech Sister @ykilcher

👉Peer Review is *still* BROKEN👈 The NeurIPS 2021 review experiment results are in and show just how arbitrary conference reviews are🤬 Watch today's video where we go over the results in detail and discuss what can be done about it⬇️ youtu.be/DEh1GR0t29k

Cassie Kozyrkov @quaesita

It's finally here! Your cheeky and fun guide to #AI algorithms: bit.ly/mfml_part4 Happy Thanksgiving, everyone! 🦃 #MachineLearning #MFML #DataScience youtube.com/watch?v=9PBqqx…

youtube.comGuide to AI algorithms - MFML Part 4Making Friends with Machine Learning was an internal-only Google course specially created to inspire beginners and amuse experts. Today, it is available to e...

The Verge @verge

The Verge’s 2021 holiday gift guide for working from home trib.al/JECCuC1

Yannic Kilcher, Tech Sister @ykilcher

🔥New Video🔥Are you still training your networks? Why not just predict the final parameters? Today, I'm joined by @BorisAKnyazev to talk about "Parameter Prediction for Unseen Deep Architectures", the ideas behind, and a bit about its public reception. youtu.be/3HUK2UWzlFA

First Author Interview
Parameter Prediction for Unseen Deep Architectures

👩‍💻 Paige Bailey #BlackLivesMatter @DynamicWebPaige

Research orgs have issues; but one of the things I've always liked about them (in both academia and industry) is the acknowledgement that if an org doesn't have an expert in a field of work, then that org probably isn't going to do that work. People are the most important part.

Footnotes:

[1] https://blog.andrewcantino.com/blog/2021/04/21/prompt-engineering-tips-and-tricks/

[2] https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/

[3] https://fsi.stanford.edu/publication/opportunities-and-risks-foundation-models

[4] http://ai.stanford.edu/blog/in-context-learning/

[5] https://github.com/nelslindahlx/NLP/blob/master/Yet_another_working_GPT2_corpus_example_with_checkpoint.ipynb

What’s next for The Lindahl Letter?

Week 46: Machine learning and deep learning
Week 47: Anomaly detection and machine learning
Week 48: Machine learning applications revisited
Week 49: Machine learning assets
Week 50: Is machine learning the new oil?
Week 51: What is scientific machine learning?
Week 52: That one with a machine learning post

I’ll try to keep the what’s next list forward looking with at least five weeks of posts in planning or review. If you enjoyed reading this content, then please take a moment and share it with a friend.

The Lindahl Letter

Prompt engineering and machine learning

Discussion about this post