Karpathy's Obsidian RAG + Claude Code = CHEAT CODE

Plik: 2026-05-06_Karpathy's_Obsidian_RAG_+_Claude_Code_=_CHEAT_CODE.md · Pobierz transkrypcję (.md)
---
title: "Karpathy's Obsidian RAG + Claude Code = CHEAT CODE"
video_id: OSZdFnQmgRw
source: https://www.youtube.com/watch?v=OSZdFnQmgRw
date: 2026-05-06
type: transcript
---

# Karpathy's Obsidian RAG + Claude Code = CHEAT CODE

[Źródło](https://www.youtube.com/watch?v=OSZdFnQmgRw)

## Transkrypcja

Andre Carpathy just gave us the keys to
his personal Obsidian Rag system. And I
put Rag in air quotes because this
Obsidian power knowledge base has no
vector database, no embeddings, and no
complicated retrieval process. Yet, it
solves the exact same problem that these
more complicated rag structures claim to
do, which is allow our large language
model to handle large amounts of
documents and answer questions and
gather accurate information about them.
And the best part about this Obsidian
powered system is that it is very
lightweight. It's essentially free and
it is the perfect middle ground for a
solo operator or a small team. So today
I'm going to show you how Carpathy's
Obsidian knowledge system works, how to
set it up yourself, and how it differs
between traditional rag systems so you
know if this is the right option for
you. So the process by which we are
going to create this obsidianpowered
knowledge system was laid out yesterday
in a pretty comprehensive Twitter post
by Andre Karpathy. Now, the big takeaway
from this post is that we are able to
create large language model knowledge
bases that essentially act in the same
way as something like light rag or rag
anything or any other graph rag system
with obsidian. And we're able to do so
in a rather simple manner by just having
a clever structure to our file system
and how we actually ingest data. And the
end result is that I am able to ingest a
pretty significant amount of data and
documents into my Obsidian vault and use
cloud code to ask questions about it to
figure out connections between different
things. Aka the exact same thing you
would do with a traditional rag system,
but with none of the overhead and a way
simpler setup. And as Andre lays out,
the setup looks something like this.
First, we have data ingestion. We are
bringing in articles. We're bringing in
papers. We're bringing in repos from the
internet or from wherever and we're
bringing it into a raw directory inside
of our Obsidian vault. This is
essentially the staging area before it
gets turned into a wiki. We as the human
being in this interaction are able to
see all of this happening via Obsidian.
Obsidian for all intents and purposes is
our front end. Here is where I can see
where all the documents are laid out.
Here's where I can read all the wiks.
So, it isn't sort of abstracted away in
a black box like it isn't a rag system.
It's kind of hard even in a graph rag
setup like light rag to actually go
inside of here and really see
everything. I mean I can but as cool as
this looks this isn't you know very
efficient and from there you just do a
Q&A via something like cloud code and
like Andre laid out here he expected
that he would have to reach out for
something like rag but the large
language model has been pretty good
about automaintaining index files and
brief summary of all the documents it
reads and this is something we are going
to be able to do too with a pretty
simple cloudmd file which I will be
giving you and you will be able to find
that cla md as well as a written guide
that comes with a bunch of prompts
inside head of my free Chase AI
community. There will be a link to that
in the description of this video. And
speaking of Chase AI, and you knew this
was coming, quick plug for my Cloud Code
Masterass. Just released this a couple
weeks ago, and it is the number one
place to go from zero to AI dev,
especially if you do not come from a
technical background. You can find a
link to this in the pinned comment. So,
make sure to check this out if you're
serious about learning this tool. Now,
before we jump into the specifics of how
to set up this Obsidian system for
yourself, let's go over the actual file
structure because this is important to
understand how data is coming into our
vault and then getting turned into
wikis. So, the Obsidian vault is where
everything lives. As you'll see, if
you've never used it before, when you
download Obsidian, you are going to
designate a specific folder as the
vault. In my case, it is quite literally
called the vault. That's where
everything in Obsidian lives. As a
subfolder of the vault, we are going to
have the raw folder. The raw folder is
where all of our research gets dumped.
Anything we want to manually include in
these wikis gets put. This is
essentially the staging folder. So, this
is where all the raw data is going to be
held. This can be markdown files. This
can be PDFs. And I'm going to show you
how to use the Obsidian Clipper to
essentially turn any web page into a
markdown file that gets sent to the raw
folder automatically. We will have
another subfolder that is the wiki
folder. So what the large language model
is going to do, what cloud code will do
for us is on demand or you could have it
even be a skill or have it be automated
is we are going to point it at the raw
folder and say hey I want you to create
a wiki about whatever subject you've
been gathering information about. From
there it will then create a wiki about
that. So you can see we have three
different wiks here. One for AI agents,
one for rag systems and one for content
creation. Now in in between the wiki
folder and these sub wiki folders is the
master index markdown. This is
essentially just a list of all of the
different wiks that have been created
because the idea is when you this is you
when you talk to claude code all right
that's cloud code over there and say hey
I want to learn more about AI agents can
you ask you know I want to ask questions
about my wiki well what is it going to
do? Well, it's going to go to the vault
because you're probably already in
there. It's then going to go to the wiki
folder. It's going to go to the master
index folder and say, "Hey, what wikies
have we created?" Oh, he wants to know
about rag systems. Okay, goes down to
rag and the wiki folders themselves have
index files which break down all the
additional content. So, what Obsidian
gives us and what this file structure
gives us is a very clear path to find
information even if we have a ton of it
floating around. And this helps claude
code because it's not going to have a
ton of issues finding the data. We're
not going to run a million tool calls to
see what's in our file structure, but it
also helps you because it's very clear
where to go. For example, over here on
the left is my Obsidian folder. I'm in
the Obsidian UI, and we'll go through
the download here in a second. But if I
want to see a wiki, what do I do? I just
go to wiki. I have a master index which
lays down everything in there. Right
now, it's just three things, but if
there were 3,000, it still wouldn't be
too difficult. And then from there, you
know, I can click on it. It takes me to
the index of that specific wiki. And
then I can look at different stuff
inside of there. It's that simple. And
it's that simple for AI, too, which is
why we're able to use essentially just a
markdown file structure to somewhat
mimic a rag system. So, while that
theory is cool, now let's go into how to
actually set this up for yourself. First
and foremost, you're going to need to
download Obsidian. You're just going to
head to obsidian.md,
hit download now, go through the wizard.
It's completely free and you're going to
designate some folder as the vault. Just
create one, call it the vault. Makes it
easy for me and that'll probably work
for you. After we create the vault, we
now need to set up this file structure
inside of it. The easiest way to do that
is with Claude Code. Simply open up
Claude Code in the vault. So that's the
directory I'm in. And you're going to
give it a prompt telling it to create
this file structure. Now, luckily for
you, I already created the prompt. So
you can just copy this thing and paste
it in the cloud code. Now, if you're
like me and you've already been using
Obsidian for a bit, you probably have a
bunch of folders already in there. So,
maybe you don't want to call it RAW.
Maybe you want to call it something
else. The whole point of it is you just
need to designate some folder is, like I
said, sort of the holding area or the
staging area for where all this
information is going to get dumped until
it gets turned into a wiki. So, adjust
as needed. Now, the next thing we want
to do is create a cloud.md file.
Personal assistant type projects, things
like this that are very markdown heavy,
claw.mds are perfect for. And this
claw.md file is breaking down the
knowledgebased rules as well as how to
essentially traverse it. So again that
we aren't wasting tokens when we ask
questions. Again I have this entire
claw.md template prompt you can use this
claw.md file is also telling claude how
to structure these markdown files. So
it's very easy to traverse files with
this wiki links format. Now let's talk
about how we can bring things into this
raw folder. How we can get data into our
system in the first place. Well, a super
easy way to do this is with the Obsidian
Web Clipper. So, I will put a link to
this in the school or you can go to
obsidian.mmd/clipper.
And this is just a Chrome extension
which makes it super easy to turn a web
page into data into a markdown file.
Now, the one issue with this web clipper
is it's going to struggle with images.
It's just not even going to bring them
in. It'll have them as a link. But I
want to be able to see the images from
these documents I ingest inside of
Obsidian. So, what do we do? Well, we
are going to use an Obsidian community
skill or Obsidian community plugin to
help with this. So, one of the cool
things about Obsidian is the community
plugins. There's thousands of them. So,
if you're inside of Obsidian, I'm inside
the desktop app right now. If I come
down here and I hit this little gear,
I'm going to go to community plugins.
I'm going to go to browse. And then
you're going to search for local images
plus. You're going to download it,
install it, and turn it on. Make sure
it's enabled. You can confirm it's
enabled by heading to your community
plugins tab and seeing this little tab
turned on. Now, if we use the Obsidian
Web Clipper, and I can see that over
here as an extension, you can see what
happens. It immediately pulls
everything. And if I hit add to
Obsidian, I can see this entire article,
including the images. Now, there is one
thing we need to set up inside of the
web clipper, and that's making sure it
actually pulls it into the raw folder
automatically. I don't want to have to
manually do that. You're just going to
head to the options on your web clipper.
I just rightclicked it. And then over
here on the left where it says default,
I created my own new template, but you
can stick on the default if you want.
Where it says location
and note location right here, you're
want you're going want to change that
from clippings to raw. And that will
make sure when you use the web clipper,
it automatically goes into the raw
folder. So now with the Obsidian Web
Clipper extension and the images
community plugin, we can now turn any
web page on the internet into a markdown
file that will be used for our wiki. But
that is just one data funnel. That's a
manual one. We can have Claude Code do a
bunch of heavy lifting, too. So let's
say I was trying to create a wiki about
Claude Code skills. So I told Claude
Code, let's create a wiki about claude
code skills. I already included some
info in the raw folder, what we pulled
in via the web clipper. go conduct your
own research and bring in the relevant
raw MD files to generate that wiki. So
what is it going to do? It's going to go
on the internet use its standard web
search and it's going to create its own
wiki about claude code skills. So what
you see is that this raw folder this
whole raw pipeline that's more for you.
That's for when you mainly want to put
in some information. Now you can have
cloud code do that as well but cloud
code is also smart enough to essentially
take the research figure out what's
relevant itself and just create the wiki
directly. This raw folder is really for
you the human being to have some level
of organization. And here's what cloud
code came back with. So it created the
cloud code skills wiki. We see here in
the master index that it's referenced
here. If I click on it, this then brings
us to the index of claude code skills.
And right now it has four articles. So
here's the skills overview article. You
can see it links to websites and it also
links to different articles within our
obsidian vault. So if I click on skill
ecosystem, here's more stuff. If I click
on top skills, right? So on and so
forth. There's a very clear pathway from
one article to another and how these
things relate. Which means when you ask
cloud code questions about these
articles in these subjects, it's easy
and cheap for it to answer questions
about them. Which then brings us to the
obvious question. Do we need rag at all?
You know, we look at something like this
light rag setup. You watch my last few
videos with light rag and rag anything.
and seeing how simple the setup with
Obsidian, you're probably like, "Well,
why would I ever even bother with these
more complicated setups at all?" And the
truth is, if you're a solo dev, a solo
operator, or a small team that isn't
dealing with thousands of documents, the
answer probably is Obsidian makes more
sense for you. It's lightweight, and you
really don't need Rag. These large
language models, these harnesses like
Cloud Code are good enough for your use
case. And we can sit here and get in the
weeds about the differences between the
obsidian rag and true rag. But the truth
is the big thing is scale, right? Are we
trying to scale to millions of documents
or are we not? Because at a certain
scale, it's going to be cheaper and
faster to use a proper rag system no
matter how good cloud code is at
navigating this MD file document network
you've created. But this isn't a
question you necessarily need to have
the exact answer to right away. Why
wouldn't you just start with something
like obsidian? And if it's clear your
scale goes well beyond the bounds of
what this thing can handle, then just
move into rag. I think people get really
caught up in like answering this
question when it's like just try it out.
Just experiment. It's not costing you
anything to use some sort of rag system,
rag system like obsidian. And if it
doesn't work, it doesn't work. Fine.
Then go use light rag instead. People
want to sit here as they inevitably will
in the comments and like argue this back
and forth. Just try it. And I think the
answer will be pretty clear at a certain
point when you need to move to a true
rag system. But the nice thing with this
is is again most people don't need a
real rag system. They just don't, right?
Even if they're in a small business team
situation. So having a proper, you know,
orchestrated system like the subsidian
knowledge base, I think is a huge boon
to the majority of people. So I hope
this breakdown was useful to you.
Definitely check out Andre's post about
this. He goes into a fair amount of
detail. Make sure to check out the free
Chase AI school. There's a link to that
in the description that has all the
prompts and a written breakdown of how
to actually do this if you got confused
at any part. And as always, take a look
at Chase AI Plus if you want to get your
hands on that master class. Besides
that, let me know what you thought and
I'll see you