The robots.txt protocol does not block site robots
However, care must be taken to ensure that the Robots.txt protocol does not block site robots from other areas of the site. This has a huge impact on your search engine rankings because crawlers rely on robots to count keywords, examine meta tags, titles and crosswords, and even record hyperlinks.
A misplaced hyphen or hyphen can have disastrous consequences. For example, robots.txt patterns are matched via simple substring comparisons. Therefore, you must ensure that patterns that match directories are followed by a trailing “/” character. Otherwise, all files whose names begin with this substring will match, not just the files in the expected directory.
To avoid these problems, you should consider running your website through a search engine spider simulator, also known as a search engine crawler simulator. These simulators, which can be purchased or downloaded on the Internet, use the same processes and strategies as various search engines and allow you to see how your website is read. You’ll learn which pages are skipped, which links are ignored, and which errors occur. Since the simulators also replicate how robots follow your hyperlinks, you can see whether the robot.txt protocol is affecting the search engine’s ability to read all the required pages.
It’s also important to review your robot.txt files so you can identify and fix problems before submitting them to real search engines.