What is the best practice to implement search to index XML files?

Stef Heyenrath
  • What is the best practice to implement search to index XML files? Stef Heyenrath

    In our SharePoint solution we need a service which can search and index XML files.

    At this moment we are using FAST, but are there other tools / applications / solutions which are free and easy to integrate / customize within SharePoint 2007 ?

2007 search fast-search crawling xml
Related questions and answers
  • I am using Microsoft Search Server 2008 (based on SharePoint Server 2007) + C# + .Net 3.5 + VSTS 2008 + ASP.Net to develop a web application which invokes Search Server 2008 Web Services when a button in the html page is pressed. I am using the following code to query content from Microsoft Search Server 2008. My question is how to display the search results from the DataSet retrieved? I did not find a very good sample from Google. protected void Button1_Click(object sender, EventArgs e) { //The string containing the keyword to use in the search string keywordString

  • than a tree. I've verified that it isn't the later steps of loading into a DataTable/obtaining XML/transforming which is the issue - it's coming back from the search API like this. Any ideas why...I'm writing some code against the search API in SharePoint 2007, and am seeing some interesting behaviour. I'm using the KeywordQuery 'model' rather than FullTextSqlQuery, since this matches... are in a different format, possibly due to an encoding issue somewhere. As an example, HitHighlightingSummary does not contain child XML elements as it does in an OOTB search - instead, there is one text

  • The index will be kept on the Index Server, so why do we need a Search database and what is the point of it in SharePoint 2007? I would like to understand the basic concept of SharePoint Search. Below are a few questions: Why do we need a search DB? What is the use of it? Why it is consuming more DB space compared with other SharePoint DBs? When it will consume more CPU? What are all the important tables to know? Do we need to have an individual DB instance to keep it? Please help me with your thoughts.

  • I am using MOSS 2007 to provide a custom search results page. I need to display a number as part of search results which indicates the number that result appears in the whole result set. The problem is that the XML returned by search does not contain data pertaining to the page (of results) that you are on. I can use the value of 'id' but this only works if I do not use paged results... into the search results to enable this?

  • We would like to introduce a new WFE in the existing SharePoint 2007 Farm, due to the business requirement, which already has 8WFEs and separate servers for Index, Query & admin. what would be the best approach to follow ?

  • This is an old issue I thought this was a bug in an specific environment after reading a couple of blogs it has made me think again... The issue is that SharePoint site collections and sites are not correctly picked up by the search index. All the content is indexed but the sites themselves are not associated to the correct contentclass. Prior to SP2 the site collections (SPSite's) are listed...) but there is still no fix for 2007. I recently applied the April 2010 CU to a test farm, reset the search index and recrawled and still the results are missing sites and webs from publishing sites. No fix

  • I have a large amount of PDFs and I need to be able to search them in MOSS 2007. I am aware of the iFilter which is required, but it will not index scanned PDFs. What is the fastest way to handle this problem?

  • I was looking for some benchmarks on the search capabilities of FAST Search Server to try and determine whether or not we need a single or multi-server environment for FAST Search Server. Do any exist? How big of an index have people seen before FAST Search starts to have issues serving queries or indexing? Thanks!

  • I'm getting this following error only when performing a search through the search center. "Your search cannot be completed because of a service error. Try your search again or contact your administrator for more information" Here's the link that is used to show the results page in a iFrame. I tried several stuffs like, resetting the crawl content re-configuring the Office Search Service re-associating the index server in the SSP None of these worked. Any thoughts on this?

Data information