- From other website, error seen is "This site can't be reached. The connection was reset." - Only one of the two presentation servers is displaying errors, the other server displays websites as expected.
Issue resolved after starting Discovery microservice on offending server, and IIS (alternatively, could have just started the applicable application pool). Likely issue was triggered because Windows updates were applied to server during non-business hours, and after restarting server the Discovery service had not restarted successfully. Some steps suggested as below to reduce likelihood of future outages or to recover from an incident, note these actions may vary from customer to customer and for different environments: - Change the startup type of services from Automatic to "Automatic (delayed)" for failing services. - Review the core log of failing service for indications of why the service had failed. - Monitor the microservice endpoints (possible in Tridion Sites 9.0 and later product versions only). For DXA websites, at minimum the following microservice endpoints should be monitored. http://SERVERNAME:PORT/content.svc/health http://SERVERNAME:PORT/discovery.svc/health http://SERVERNAME:PORT/context.svc/health