New features were deployed requiring a reconfiguration of our docs search index. The reconfiguration required re-indexing all of the documentation.
The improved search functionality required more CPU resources compared to the previous implementation. When the re-indexing was triggered, high CPU usage occurred causing a backlog of index writes and ultimately connection timeouts to the search service. Automatic retries of index record insertions further compounded the issue.
After cancelling all the search index tasks, to avoid the automatic retry, we were then able to increase our search service capacity. With the capacity increased, we were able to attempt the indexing operation again.
We shall review the incident and make changes to reduce the risk of resource exhaustion if we need to rebuild our search indexes.