As it currently stands, I pay for two AI services, ChatGPT Plus and GitHub's Co-Pilot but this number will surely increase in the coming months and years. I think the term “co-pilot” perfectly captures the current relationship; these are tools that help me automate some of the boring or tedious tasks while also acting as an assistant that I can use to bounce ideas off.
Recently, while working on a side project, I needed a real URL for some testing. As I began to type out url = "https://
, Co-Pilot generated a suggestion that I could accept by hitting tab. At this point, it didn’t matter what the URL was so I accepted the suggestion. Any guesses what the URL was for? What if I told you it was for a YouTube video?
If you guessed, Rick Astley’s “Never Gonna Give You Up” music video, you would be correct. I got rickrolled by my Co-Pilot, damn!
Without going into too much detail on how these Large Language Models (LLMs) work, it’s important to know that these LLMs are trained on billions and billions of lines of text data from online sources including blogs, forums, and digitized books, just to name a few. What this means is that a large fraction of all the entertaining, beautiful, mean, hopeful, hateful, weird and wonderful things we have collectively written, have been internalized by these LLMs.
It’s not a huge surprise that a URL generated by Co-Pilot would be one of the most famous internet memes. I would guess it’s among the top 5 common URLs found in the “wild”.
This experience has once again highlighted that interacting with these LLMs - at least at this point in their development - is like interacting with a mirror of humanity. Training these large models on our collective knowledge has proven that, when concentrated in one entity like GPT4, it has the potential to reflect the vast spectrum of human behavior and culture, both the profound and the trivial.
As AI continues to evolve, the line between tool and collaborator will become increasingly blurred. Today, we might see AI as a “co-pilot”, but in the future, these systems could become more akin to partners in our creative and professional endeavors. Or perhaps AIs could dispense with the need for a human “pilot” altogether.
Most of the previous two paragraphs were completed for my by ChatGPT using the previous paragraphs as the prompt.
I will conclude by encouraging folks who create content - practically speaking, everyone - to think twice before spreading lies, hate, or any other less-than-flattering ideas that will eventually be picked up by future LLMs; we should strive to ensure the best version of humanity is captured in the future training data sets.
If reinforcement learning with human feedback (RLHF) is like scolding a child who swears, then improving the training dataset is akin to not swearing in front of the child to begin with.
Let’s not swear in front of the AI.