dbTalk Databases Forums  

what happens when the public connection is lost?

microsoft.public.sqlserver.clustering microsoft.public.sqlserver.clustering


Discuss what happens when the public connection is lost? in the microsoft.public.sqlserver.clustering forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
Hurme
 
Posts: n/a

Default what happens when the public connection is lost? - 02-05-2008 , 01:46 PM






A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to each
of the nodes on monday and restart the cluster admin service. Once this was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried to
fail-over to node2. Node2 could not communicate won the public connection so
it tried to fail-ove to node1. This flip-flopping happened a few more times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up and
remove themselves from the cluster thus requiring the restart of the cluster
services?
--
MG

Reply With Quote
  #2  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM






What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #3  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #4  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #5  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #6  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #7  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #8  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #9  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
  #10  
Old   
Geoff N. Hiten
 
Posts: n/a

Default Re: what happens when the public connection is lost? - 02-05-2008 , 02:24 PM



What happened is an underlying resource failed so SQL tried to find a node
where the resource existed. Eventually it would have exhausted the retry
count and "given up". IF SQL fails to start or restart a given number of
times within a certain time period (both adjustable using the cluster tool),
the clustered resource will be considered unrecoverable and will require
manual restarting.

--
Geoff N. Hiten
Senior SQL Infrastructure Consultant
Microsoft SQL Server MVP




"Hurme" <michael.geles (AT) thomson (DOT) com> wrote

Quote:
A SQL 2000 db on W2K3 cluster with 2 nodes lost it's public connection over
the weekend. Both VIPs were unpingable. I was able to rdc directly to
each
of the nodes on monday and restart the cluster admin service. Once this
was
done the VIPs were reachable again.

I'm trying to understand what happened here. There are messages in the
event log that show that Node1 lost it's public connection and then tried
to
fail-over to node2. Node2 could not communicate won the public connection
so
it tried to fail-ove to node1. This flip-flopping happened a few more
times
before another message appeared indicating the cluster could once again
communicate on the public conn. However, now of the apps were able to
connect to the db.

After a certain number or trys, did both of the nodes eventually give up
and
remove themselves from the cluster thus requiring the restart of the
cluster
services?
--
MG


Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.