How to configure search and promote for intranet site which requires login ? | Community
Skip to main content
Hemant_arora
New Participant
April 17, 2017
Solved

How to configure search and promote for intranet site which requires login ?

  • April 17, 2017
  • 2 replies
  • 1486 views

We need to configure search and promote that it crawls the intranet pages also which requires login.

We have currently implemented SAML for the intranet site and using S&P for unauthenticated users.

We want S&P to work for authenticated intranet users also and crawl intranet pages too..

Can you please suggest any approach ?

This post is no longer active and is closed to new replies. Need help? Start a new post to ask your question.
Best answer by edubey

 Regarding Intranet, 

1. S&P is a SaaS only solution

2. The challenge with crawling intranet sites is the crawler cannot access them as they typically are not exposed to the internet.

3. Usually what has to happen is either by IP or some domain name S&P can find the right external server.  By IP the S&P crawlers are whitelested. That would mean the intranet servers are not on internal servers but in the DMZ but have access locked down. They can also feed the data to S&P, not actually crawl the content. The feed is usually an XML export of the content for indexing.

Regarding Login, 

There are projects where S&P uses user credentials for crawling.

I hope it helps, I am not an S&P expert but got this info from internal experts.

thanks

2 replies

edubey
edubeyAccepted solution
New Participant
April 21, 2017

 Regarding Intranet, 

1. S&P is a SaaS only solution

2. The challenge with crawling intranet sites is the crawler cannot access them as they typically are not exposed to the internet.

3. Usually what has to happen is either by IP or some domain name S&P can find the right external server.  By IP the S&P crawlers are whitelested. That would mean the intranet servers are not on internal servers but in the DMZ but have access locked down. They can also feed the data to S&P, not actually crawl the content. The feed is usually an XML export of the content for indexing.

Regarding Login, 

There are projects where S&P uses user credentials for crawling.

I hope it helps, I am not an S&P expert but got this info from internal experts.

thanks

MC_Stuff
New Participant
April 17, 2017

Hi Hemant,

  AFAIK this needs custom implementation & not available out of the box. Need to engage with profession service.

Thanks,