Home / Blog / Find orphan pages in Google Search Console
Guide
Find orphan pages in Google Search Console
An orphan page is a dead end to a search engine. Google Search Console shows you which pages it has found, but finding the ones with no internal links pointing to them takes a manual cross-reference. Here is exactly how to do it.
To find orphan pages in Google Search Console, start in the Indexing section, export your pages, compare the list to your internal links, and you have your orphans. The work is purely mechanical — reading a spreadsheet and cross-referencing URLs by hand. It works. It is also tedious enough that most sites never actually do it.
Here is the exact workflow, step by step, so you understand what you are looking at when you do it, or what a tool like recto is automating when you decide not to.
Why Google Search Console matters for orphan pages
Before diving into the steps, it is worth understanding what GSC shows you and what it does not. Google Search Console is Google's own view of your site: the pages it has found, how often it crawls them, how many times they appear in search results, and whether any of them have errors Google can see. This is valuable data an audit tool cannot replicate, because Google knows whether a page actually appears in search. A page that sits invisible to a crawler might still be discoverable if Google has seen it from an external link, or if you submitted a sitemap.
An orphan page is not the same as an unindexed page. An unindexed page never made it into Google's index at all. An orphan page is indexed, but nobody on your own site links to it. Google found it anyway — perhaps from an external backlink, perhaps from an old cached version — but because there is no internal link pointing to it, Google visits it less often and ranks it lower than it could be.
GSC will not tell you directly that a page is orphaned. It will not have a label that says "no internal links." What it shows is a list of pages Google has indexed, which is the starting point for your own comparison against your site structure.
Step 1: access the Indexing section
Log into Google Search Console and select your property. In the left sidebar, find the Indexing section and click on "Pages." This report shows every page Google has indexed on your site. At the top you will see a summary: how many pages are indexed, how many are not indexable, and how many are excluded.
The main table below shows individual pages, the last time Google crawled them, and how many times they have appeared in search results over the past three months. This last metric is useful: a page with zero impressions is either very new, or it is buried so deeply in search results that nobody is finding it naturally. A page with impressions is close to ranking; if you fix its orphan status, it has a real chance of moving up.
Step 2: export your indexed pages
The export button is at the top of the Pages report. Click it and save the CSV file. The file will contain every indexed page GSC knows about, plus metadata like crawl date and search impressions. Do not filter this export. You want the whole list.
You now have Google's view of your site. The next step is to compare it against what your site actually contains.
Step 3: get your complete page list
You need a list of every page on your site that should be linked. The cleanest source is your XML sitemap, which you can download and extract. If you do not have a sitemap, or if your sitemap is out of date, you can export a list by running a crawler — recto crawls your site, as does Screaming Frog and a dozen other tools. The important part is that you have a definitive list of pages you have published and want to be findable.
Open your sitemap in a text editor and extract the URL list, or export your crawl data as a CSV with all discovered URLs. You now have what your site contains.
Step 4: compare the two lists
The tedious part: you now have two spreadsheets — one from GSC showing indexed pages, and one from your sitemap showing all your pages. Your goal is to find the pages in the GSC file that have zero internal links pointing to them.
The way to do this is to compare the two files. A page that appears in your GSC export but not in your sitemap is a page Google found some other way (usually an external backlink or a cached version) but that does not appear in your site structure. More interesting are pages that appear in both files but have zero impressions and have not been crawled recently. Those are likely orphaned.
If you are comfortable with spreadsheet formulas, you can use a VLOOKUP or INDEX/MATCH to automate the match. If not, you can sort both lists alphabetically and scan for gaps. The work is manual, but it is straightforward.
Step 5: verify with the Links report
Once you have a candidate list of orphans, spot-check them using the Google Search Console Links report. This report shows which external sites link to your pages, and also which pages on your site link to a given URL. If a page has zero "internal links" in the Links report, it is truly orphaned — nothing on your site points to it.
Go to the Links section, find one of your candidate orphan pages in the search box, and check how many internal links point to it. Zero means you have found an orphan page worth fixing. If there are internal links and you missed them, you now have a better understanding of your site structure.
Step 6: use URL Inspection to confirm indexation
For any page you are about to fix, use the URL Inspection tool to confirm it is actually indexed. Paste the URL, and GSC will show you whether Google has crawled it, whether it is indexed, and any errors it found. A page that is crawled and indexed but has zero impressions and no internal links is a perfect candidate for fixing. A page that is not indexed at all needs a different approach — you might need to add it to your sitemap or debug why Google cannot crawl it.
Why the manual method is cumbersome
This workflow is reliable and free. It uses data directly from Google, with no guesswork. The trouble is that it does not scale beyond about fifty or a hundred pages. A site with five hundred indexed pages and a hundred-item sitemap becomes a nightmare to cross-reference by hand. You wind up with spreadsheets, helper columns, and hours of clicking to verify individual pages.
The comparison also does not tell you which orphan pages matter. An orphan page with zero search impressions might not be worth your time. An orphan page with two hundred impressions — getting close to ranking but held back only by its orphaned status — is worth hours of work to fix. The manual method leaves you staring at a list of orphaned URLs with no way to prioritize them.
Where automation saves time
This is the exact problem recto solves. Instead of exporting files and doing spreadsheet surgery, recto crawls your site once, pulls your Google Search Console data, and automatically cross-references the two. It shows you every orphan page ranked by search impressions, so the pages closest to ranking float to the top.
For each orphan, recto finds a relevant page on your site that already has authority, scans that page for a phrase that would work as anchor text, and shows you the proposed link before you approve it. You review, you click approve, and recto publishes the link through the WordPress REST API — then fetches the page again to confirm the link is live. No spreadsheet, no manual cross-reference, no guessing about priority.
The manual GSC method is the foundation of the work. Understanding what you are doing when you use a tool — what data it is comparing, how it is ranking the results, how it knows a page is orphaned — makes you a better judge of whether the tool is doing the job right. And if you want to do it by hand, now you know exactly how.
Next steps
If you are going to fix orphan pages, start with the manual method on a small subset of your site to understand the process. Export your indexed pages from GSC, compare them to your sitemap, and spot-check a few candidates with the Links report. If you find orphans, link to them from a relevant existing page using a phrase that describes the destination. After you publish, use URL Inspection to confirm the link is live.
Once you understand the method, the question becomes scale. A site with thousands of pages needs automation to make the work sustainable. Whether you reach for a desktop crawler, a plugin like Link Whisper, or a dedicated tool like recto, the principle is the same: crawl your site, pull GSC data, find the orphans, and link them from pages that can help. The tool changes the tedium from hours to minutes. The work itself — rescuing forgotten pages and connecting them back to your site — stays the same.
Sources
- Google Search Console Indexing > Pages report — support.google.com
- Google Search Console Links report shows pages that link to your site — support.google.com
- URL Inspection tool in Google Search Console — support.google.com
- Sitemaps and how Google crawls them — developers.google.com