</>
Vizly

What is AI? From Rules to Learning

April 4, 20268 min
AIBasicsHistory

AI, ML, Deep Learning, GenAI — what do these words actually mean? A plain-English starting point with zero jargon.

1What is AI? From Rules to Learning
2How Machines Learn
3Neural Networks from Scratch

So... what is AI, really?

You hear "AI" everywhere now. Your phone has it. Your email has it. Your toaster probably has it (okay, maybe not yet).

But what does it actually mean?

At its core, AI is just this: making computers do stuff that normally needs a brain. Recognizing a face in a photo. Translating a sentence. Beating you at chess. Writing code. That's all AI.

Here's the thing though — AI isn't one technology. It's more like a family.

Think of it like Russian dolls. Each one fits inside the bigger one:

  • AI is the big picture — any computer that seems "smart"
  • Machine Learning is a specific way to make AI — by showing it examples instead of writing rules
  • Deep Learning is a specific way to do ML — using layers of artificial "neurons"
  • Generative AI is a specific use of Deep Learning — creating brand new content

So when someone says "ChatGPT is AI" — yes, technically. But it's specifically Generative AI, built on Deep Learning, which is a type of Machine Learning. Knowing which layer you're talking about actually matters.


The old way: writing rules by hand

Imagine you're building a spam filter in 1995.

You'd sit down and write rules like this:

IF email contains "free money" AND sender is unknown
THEN → spam

Makes sense, right? You're the expert. You know what spam looks like. So you write hundreds of these rules, one by one.

This actually worked! Banks used systems like this. Hospitals too. They were called Expert Systems — because human experts wrote all the rules.

But there's a problem.

What happens when a spammer writes "Fr33 M0n3y"? Your rule doesn't catch it. So you add another rule. Then they change tactics again. And again. You're playing whack-a-mole forever.

The real world is messy. You can't write a rule for every possible situation. And that's where the second approach comes in.


The new way: let the machine figure it out

Instead of you writing the rules, what if the computer learned them on its own?

That's Machine Learning. And the idea is surprisingly simple.

You take 10,000 emails. Half are spam, half are normal. You label them. Then you feed them to an algorithm and say: "figure out the difference."

The algorithm looks at thousands of tiny patterns — word choices, formatting, sender behavior, time of day — stuff you'd never think to check. And it builds its own internal "rulebook."

Now when a new email comes in, the model checks it against everything it learned. Even "Fr33 M0n3y" gets caught, because the model learned deeper patterns than just exact word matching.

This is the big idea

Old AI: humans write the rules. New AI: humans provide examples, the machine writes its own rules. That single shift changed everything.


Three flavors of Machine Learning

1. Supervised Learning — learning with an answer key

This is the most common type. You give the machine a bunch of examples with the correct answers attached.

Think of it like studying for an exam with a textbook that has solutions in the back.

You show the machine...With the answer...
A photo of a cat"This is a cat"
A photo of a dog"This is a dog"
An email"Spam" or "Not spam"
House details (size, location)"Worth $450,000"

After seeing thousands of these, it learns the pattern. Show it a new photo, it can tell you: cat or dog. Show it a new house, it can guess the price.

Where you see this: Gmail spam filter, Netflix recommendations, face unlock on your phone.

2. Unsupervised Learning — finding patterns without answers

Sometimes you don't have the answers. You just have a pile of data and you want the machine to find something interesting in it.

Imagine dumping 100,000 customer profiles on a table and asking: "Are there any natural groups here?" The algorithm might discover that your customers fall into 5 distinct types based on their behavior — even though nobody told it those groups existed.

Where you see this: Customer segmentation, detecting unusual credit card activity, YouTube's "related videos."

3. Reinforcement Learning — learning by doing

No textbook, no examples. The machine just tries stuff and sees what happens.

It's exactly how you'd train a dog. You don't explain the rules of "fetch." The dog tries random things. When it brings the ball back, it gets a treat. Over time, it figures out what works.

Where you see this: AlphaGo (the AI that beat the world Go champion), self-driving cars, how ChatGPT gets fine-tuned to give better answers.

Quick cheat sheet

Supervised = you give answers, it learns the pattern. Unsupervised = no answers, it finds hidden groups. Reinforcement = it tries stuff and learns from rewards. Most AI you use daily is supervised.


Why now? AI has been around for decades.

This is the question people forget to ask. AI research started in the 1950s. Neural networks were invented in the '50s too. So why did everything explode in the last few years?

Three things happened at the same time:

1. We started drowning in data. The internet gave us more data than we ever imagined. Every photo, every search, every message — it's all training material. We went from "not enough data" to "more data than we know what to do with."

2. Graphics cards got really good. GPUs — originally made so your video games look pretty — turned out to be perfect for training AI. They can crunch thousands of math problems at the same time. Without GPUs, training modern AI would take years instead of days.

3. Someone figured out a better architecture. In 2017, a team at Google published a paper called "Attention Is All You Need." It introduced the Transformer — a new way to build neural networks that's way better at understanding language. That's the "T" in GPT. This one paper changed everything.

The recipe

Tons of data + fast hardware + a clever new design = the AI boom we're living through right now. Remove any one of these three and it wouldn't have happened.


A quick timeline (the highlights)

YearWhat happened
1950Turing asks "Can machines think?"
1957First neural network built (the Perceptron)
1997IBM's Deep Blue beats Kasparov at chess — but it was all rules, no learning
2012Deep Learning wins an image recognition contest by a huge margin — people take notice
2016AlphaGo beats the world Go champion — this was supposed to be decades away
2017"Attention Is All You Need" paper — Transformers are born
2022ChatGPT launches — 100 million users in two months
2024Reasoning models, AI agents, video generation
2025AI coding agents, real-time collaboration with AI
2026Agentic workflows go mainstream, AI reasons through complex tasks

The field had two "winters" (1970s and late '80s) where progress stalled and funding dried up. Each time, people said AI was overhyped. Then it came roaring back.


What's next?

Now you've got the big picture. The words AI, ML, Deep Learning, GenAI — they're not interchangeable buzzwords. They mean specific things, and you now know what.

Next up: how machines actually learn from data. We'll take the simplest possible ML model, watch it train step by step, and build some gut feeling for what "learning" really means when there's no brain involved.

No math. Just pictures and plain English.

Edit this page on GitHub