Jump to content

Deen

Hornbill Staff
  • Posts

    783
  • Joined

  • Last visited

  • Days Won

    30

Posts posted by Deen

  1. For anyone affected, below is the root cause analysis for this disruption:

    On the morning of the 29/11 at approximately 09:15 a number of customers began reporting that their Hornbill instances were unavailable.  

    After investigating the issue it was determined that there was a problem with the deployment of the latest Core UI framework release that caused some files from the previous version to be served along side the new release, this then lead to the old files being incorrectly cached in Cloudflare alongside files in the new release.  An incompatablity between these releases lead to errors when trying to load the framework.
      
    To fix this we deployed a patch by 09:40 to break the cache again so that all requests would get the correct version. We are planning to make changes to the release process to prevent this happening again which will include purging the Cloudflare cache after deployment.

    We apologise for the inconvenience this will no doubt have caused.

    • Thanks 1
  2. After additional tests this appears to be an issue with Office365 where intermittently it is unable to complete a valid connection.  Any emails stuck in your Outbox will have 9 retry attempts, if any of these are successful they will leave the mailbox as expected - you will see from your Sent Items folder that emails are sending, it is a few specific ones that fail.  You can also manually resend affected mails using the envelope icon if they are time sensitive.

    We have had reports from a number of other Customers with the same issue, so there does seem to be a problem with Office365 mail at the moment.

  3. All, we will be performing an update at 11.30am which should correct all the issues that have been reported this morning.  However this will result in a temporary outage of a couple of minutes.  We apologise for the inconvenience this will cause.

  4. @RobW @Dan Munns @HGrigsby This is a change that is covered in the latest ESP release notes:
     

    • Stops the system from processing the reports further when the maximum is reached and to return an error report as the output. Updated the HTML templates for the reports so that the error message can be produced and displayed.

    Now I believe that previously these reports would fail silently, if you correct the column count to match the configured limit there shouldn't be a problem.

  5. @Darren Rose I take it we are talking about widgets here rather than actual reports?  We have looked at some of the heavier ones and the largest contains the following in the query:

    h_itsm_requests.h_datelogged >= '2019-04-30 00:00:00'

    Is there a need for the query to bring back requests from early 2019?  If this can be reduced to a weekly or monthly value that should significantly reduce the query time.  Also looking at the query two of the fields h_fk_servicename and h_catalog don't have indexes, you could swap them with h_fk_serviceid and h_catalog_id as that should also help to speed up the query and get it under the current threshold.

×
×
  • Create New...