TXT Files
The TXT Files section allows you to view text-based configuration files located in the root directory of the website.
These files typically include configurations such as:
- Search engine crawling rules
- Security policies
- Bot access rules
WebPixie automatically detects these files and allows you to view their contents directly.
Robots.txt
The robots.txt file contains rules that determine how search engine bots should crawl your website.
With this file, site owners can control:
- which pages can be crawled
- which pages cannot be crawled
Common Directives in robots.txt
User-agent
Specifies which search engine bot the rules apply to.
Example:
User-agent: *This means the rules apply to all bots.
Disallow
Indicates which pages should not be accessed by search engines.
Example:
Disallow: /admin
Disallow: /loginThese rules prevent the specified pages from being crawled.
You can use multiple Disallow directives to block
different sections of your site from search engines.
Using WebPixie with robots.txt
WebPixie automatically detects the robots.txt file and displays its contents in the analysis screen.
This allows users to:
-
quickly review crawling rules
-
spot misconfigurations
-
identify critical SEO issues
Reviewing your
robots.txtregularly helps ensure your site is indexed correctly and avoids accidental blocking of important pages.