Search Center issue

George2
  • Search Center issue George2

    I am using SharePoint Server 2007 on Windows Server 2008. I am using Search Center to crawl web data source (i.e. crawl web page from other web sites). My question is related to crawled page counters displayed for the web data source log page of Search Center.

    My question is, there are 3 crawl counters displayed, successful counter, fail counter and warning counter. For each counter value, will there be any duplication urls? For example, it is reported for web data source www.mysite.com, 1000 are crawled successfully, 10 failed, no warning. Does it mean there are 1000 distinct web pages stored in Search Center? I am not sure whether there are any duplicated Urls in the 1000 counted pages?

    BTW: I have this confusion because I set daily incremental page crawl, for example, if http://www.mysite.com/1.html is crawlered both yesterday and today (both cases are successful crawl), will it be counted twice? Appreciate if anyone could provide some documents about what are the counters' meaning?

    thanks in advance, George

  • If you crawl a regular website it is going to follow each of the links. It shouldn't duplicate pages, but it will see the reference to the home page for example many times. Ultimately you would determine the number of pages or items by looking at the Items in Index count not the number of items crawled.

Tags
2007 windows-server search
Related questions and answers
  • I am using SharePoint Server 2007 on Windows Server 2008. I am using Search Center to crawl web data source (i.e. crawl web page from other web sites). My question is related to incremental crawled page settings. My question is if I have set to incremental crawl daily, what means crawl incremental? If Url itself does not change, but the Url's content is updated, if I set crawl to be incremental crawling, will it be re-crawled and store the lastest content of the Url? thanks in advance, George

  • I am using SharePoint Server 2007 on Windows Server 2008. I am using Search Center to crawl web data source (i.e. crawl web page from other web sites). My question is about incremental crawl, I want to know for incremental crawl, if the Url of a web server data source itself does not change, but the Url's content is updated (e.g. daily update news page on a web site's specific front page Url), if I set crawl to be incremental crawling, will it be re-crawled and store the lastest content of the Url? Appreciate if anyone could provide any document to prove the exact behavior. thanks

  • Crawling some external sites failed Ovidiu BecheĊŸ-Puia

    I'm using Search Server Express + WSS 3.0. I wanna crawl external public web sites. One site is : http://www.av.se/ When I try a full crawl it is throwing: http://www.av.se Access is denied. Check that the Default Content Access Account has access to this content, or add a crawl rule to crawl this content. Local sites and other public sites are getting crawled OK. What is wrong with that sit? Can you add it on Content sources and try a full crawl for testing?

  • (I have also asked this question on Stack Overflow) I have a SharePoint feature I'm using to add some custom aspx files to the Pages Library. When I activate the feature, I can visit the pages...="~masterurl/default.master"%> <%-- deliberately left empty --> (yup, it's empty!) Addendum When I "visit the pages in-browser" I mean navigate to their URLs manually: http://myserver:PORT/subsite/Pages/Example.aspx When I "View All Site Content" I am looking at the contents of the "Pages" list: http://myserver:PORT/subsite/Pages/Forms/AllItems.aspx

  • I am using SharePoint Server 2007 Enterprise with Windows Server 2008 Enterprise, and I am using publishing portal template. I have two sites of the same site colleciton and I want to use Data Source Library to make site A be able to use lists defined in site B. From site A, I am using "Connect to another library" to add site B. And from Data Source Library of site A, I can see lists defined in site B. My question is how to use such lists in a page of site A? I followed the guide here, http://office.microsoft.com/en-us/sharepointdesigner/HA101191181033.aspx

  • I keep getting a large number of the following warnings in my SharePoint Farm. Event Type: Warning Event Source: Office SharePoint Server Event Category: Workflow Features Event ID: 7397 Date: 1/31/2010 Time: 1:22:21 PM User: N/A Computer: MOB-INTRA-APP2 Description: message For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp. I am using Windows 2003 R2/ MOSS 2007 + SP1 / SQL 2005. This warning is being repeated too many times in the event log. Any idea what this warning might be about?

  • Service: Office SharePoint Server Search Issue: The search service is using an account assigned to the Farm Administrators group to crawl content for Shared Services Provider SharedServices2. The crawl will include documents that are not published. Impact: Search results for Shared Services Provider SharedServices2 may contain documents that are not published. Warning 2 Service:Web Front Ends... Windows SharePoint Services Database Update 2 Now I've again configured the SharePoint Farm as per your instructions but still getting the same warning. Please let me know if I am running any

  • . But I'm uncertain as to whether this extended web application, on the new zone, fully mirrors the original web app on the default zone. For example, I am using the MossMenu component with the CSS... Adapters working for the new app/Internet zone, even though they work when using the Default zone and the internal URL. So I guess my question is... does the extended web app/new zone fully mirror... but then I realised that some URLs were still pointing to the internal server address, which obviously couldn't be resolved when accessing the site publicly. So I extended the web application

  • I am using Microsoft Search Server 2008 (based on SharePoint Server 2007) + C# + .Net 3.5 + VSTS 2008 + ASP.Net to develop a web application which invokes Search Server 2008 Web Services when a button in the html page is pressed. I am using the following code to query content from Microsoft Search Server 2008. My question is how to display the search results from the DataSet retrieved? I did not find a very good sample from Google. protected void Button1_Click(object sender, EventArgs e) { //The string containing the keyword to use in the search string keywordString

Data information