|
||||||||||||||||||
|
Search ServicesA few quick links to the information available on this site are listed below. Use the request options in the left navigation section to request changes.
Google Search ResultsIf the search results for your search terms are satisfactory then you need do nothing more. If you would like to improve the results please follow the steps below and look at the Google Best Practices page. Please remember that the measure of a good results set is not the number of pages that are returned. Rather the measure of a good results set is the appropriateness and relevancy of the links in the few pages in the set. topGoogle IndexFrequent visits by the Google indexer to your websites will keep the Google state-wide index and all the associated subcollections up to date. To check the pages for your site in the Google Index follow these steps:
You can also enter site:[domain] as the search criteria on the main search screen. For example site:das.ohio.gov would return all the pages on the das.ohio.gov website. Where multiple domain names apply to the same set of pages Google has been configured to display just one lowercase domain. This is a manual process that may need to be updated from time to time. Any errors should be reported using the Request link to the left. Does Google contain the correct web pages? If not, request additional URLs as starting points. topPage TitlesReview the page titles to ensure they are descriptive and informative. Having one of the search terms in the title makes a page move up in the rankings of a results set. It is the title that shows up when a page is bookmarked. And it is the title that is displayed by search engines. Title tags are part of the HTML content of a webpage and are the responsibility of the agency. topImportant Search Terms and Results PagesGoogle presents us with the opportunity to fine tune the search results so that each agency can specify the terms that most represent their service set and identify the primary page for each term. Identify your Top 10 service terms and acronyms and check the Google results. Results will be ranked using Google's internal algorithms. Google administrators can customize KeyMatch terms to influence the results rankings. However, the KeyMatch terms apply to the entire index as well as to any collections that the pages might exist within. We can only have 3 returns per KeyMatch. For more details, please see the Using KeyMatch to Control Results page. topPages that Should Be EXCLUDED from the IndexWith Google there are 4 methods available to exclude webpages from the state-wide index. Two are controlled by the Search Administrators and 2 by the Website Owners. Website Owners can use Robots.txt files to control indexing at the directory level and the Robots Meta Tag to control indexing at the page level. topRobots.txt FilesRobots.txt files are simple text files that control access to the directory and associated subdirectories. Many search engines, including google, honor and follow the rules set up in the Robots.txt file. To keep all robots and spiders out of a directory place a file named robots.txt in the root of the target directory. The file should contain the following code: To apply to all robots and spiders, use... User-agent: *
To disallow indexing of all pages, use... Disallow: /
User-agent:* identifies which spiders and robots are allowed to index your site. The (*) is a wildcard and means any spiders and robots. The Disallow: without (/) tells the robots and spiders they can index the entire site. If the Robots.txt file does not exist in a directory then robots and spiders crawl all links and pages. To keep certain robots and spiders out of a directory or folder place a file named robot.txt in the root of the target directory. You can also disallow files. To disallow indexing a particular folder or directory named agencies, use... User-agent: OhioSearchIndexer
Note: the "User-agent: OhioSearchIndexer" applies the robots.txt file to the Ohio Google Crawler
To disallow indexing a particular folder or directory named agencies with the file named news.html, use... User-agent: OhioSearchIndexer
To allow the state-wide index to crawl and index a directory, add this text to the top of the robots.txt file... User-agent: OhioSearchIndexer Robots Meta TagsRobots Meta Tags are simple html tags that control access to the content and links on a webpage. Robots Meta Tags are inserted into the Head tag of the webpage. Many search engines, including google, honor and follow the rules set up in the Robots Meta Tags. If the page does not have a Robots Meta Tag then the content is indexed and the links are followed. Robots Meta Tag options are:
To allow the links on a page to be followed but to prevent the content from being indexed use... <META name="robots" content="noindex">
To allow the content to be indexed but to prevent the links from being followed use... <META name="robots" content="nofollow"> To prevent both indexing and following use... <META name="robots" content="nofollow,noindex"> To allow the content to be searched by keywords... <META name="keywords" content="oranges, lemons, limes">top Domain and URL Exclusion RulesDomain and/or URL Exclusion are the two options available to the Search Administrators. To request that a specific page or domain be excluded from the search index please use the Request form to the left. From the onset the following rules have been set up for exclusion: |
|||||||||||||||||