Deen

Hornbill Staff

View Profile See their activity

Posts
783
Joined
November 22, 2006
Last visited
25 minutes ago
Days Won
30

Content Type

All Activity

Profiles

Forums

Topics
Posts

Enhancement Requests

Enhancement Requests

Everything posted by Deen

Amendments to Layout of Priority / Urgency in Incidents

Deen replied to IM Wiltshire's topic in Service Manager

@JAquino Please note that there is a new Service Manager release that should be live Tuesday morning. This release contains a fix for the Priority status issues.
- Monday at 22:06
- 14 replies
- - 1
Xmlmc method BPM error on majority of self-service tickets

Deen replied to Adam@Greggs's topic in Service Manager

@Adam@Greggs @Alistair Young Our developers have found the root cause of the problem and are working on a fix. I'll let you know when a patch is deployed.
- January 16
- 8 replies
- - 3
Global Status stopped working overnight

Deen replied to JAquino's topic in Service Manager

It appears the patch may not have completely resolved the issue. Our developers are still looking at this and another fix should be pushed out shortly.
- October 31, 2023
- 24 replies
- - 2
Self service - Attachments not coming through

Deen replied to JAquino's topic in Service Manager

@JAquino There is possibly a defect here which will need to be corrected. I'll speak to our developers and will let you know our findings shortly.
- October 6, 2023
- 18 replies
- - 1
SSO stopped working after Hornbill update?

Deen replied to Daniel's topic in Service Manager

@Adrian Simpkins @HGrigsby Good to hear and apologies for the inconvenience.
- October 3, 2023
- 9 replies
- - 1
SSO stopped working after Hornbill update?

Deen replied to Daniel's topic in Service Manager

Hi All, Hornbill are aware of the issue and we are currently working on a solution. I'll post again shortly once I have an update on our progress.
- October 3, 2023
- 9 replies
- - 2
Error sending email

Deen replied to Nikolaj's topic in Service Manager

@Nikolaj @samwoo We are looking into this and are planning to patch this very shortly.
- August 30, 2023
- 10 replies
- - 1
Unable to logon to Hornbill

Deen replied to Adrian Simpkins's topic in Service Manager

@Mhari Lindsay I've just send you a PM. I'd like to see this first hand if you could please.
Unable to logon to Hornbill

Deen replied to Adrian Simpkins's topic in Service Manager

@Mhari Lindsay I take it the affected users have tried completely refreshing their browser cache, or tried an alternative browser?
Framework Issue

Deen replied to StephC's topic in Employee Portal

Apologies all, there was an issue that has now been corrected. I'll post further details on this shortly. Deen
- June 26, 2023
- 36 replies
- - 4
Hornbill down? Unable to login

Deen replied to chriscorcoran's topic in Service Manager

@chriscorcoran We are aware of the issue and are working on a solution. I'll have a further update for you shortly and you can also check the status here https://status.hornbill.com/ Deen
- June 5, 2023
- 2 replies
Unable to upload attachments in Hornbill Service Manager tickets this morning.

Deen replied to Adam Toms's topic in Service Manager

Hi All, We are aware of an issue affecting a limited number of customer instances. Our Infrastructure team are working on a solution and I will advise when a fix is in place. Deen
Forced Service Request Closure Category

Deen replied to ssimpson's topic in Service Manager

@ssimpson There may be an issue here introduced by last nights update. We are looking into this and I will post a follow up shortly when we know more.
- April 25, 2023
- 4 replies
Unable to log into instance

Deen replied to Ketan.lakhani's topic in Service Manager

All instances should now be back up and running. We apologise for the disruption caused here and I'll have a root cause analysis posted shortly.
- February 2, 2023
- 4 replies
Unable to access instance

Deen replied to Adrian Simpkins's topic in Service Manager

@jmcnae I've corrected the link. There is also https://status.hornbill.com/ that can be used to keep up to date with the outage.
- February 2, 2023
- 19 replies
Unable to access instance

Deen replied to Adrian Simpkins's topic in Service Manager

All, we are working on a solution and you can keep track of our progress here
- February 2, 2023
- 19 replies
Unable to log into instance

Deen replied to Ketan.lakhani's topic in Service Manager

All, we are working on a solution and I'll have a further update as soon as possible.
- February 2, 2023
- 4 replies
Unable to access instance

Deen replied to Adrian Simpkins's topic in Service Manager

A limited number of instances were affected by this issue but they should all be back online. We apologise for this temporary outage and we will have a root cause available shortly.
- February 1, 2023
- 19 replies
- - 1
Search customer not resolving to known users

Deen replied to Mark (ESC)'s topic in Service Manager

All, a patch has now been released to all instances and the issue should be resolved. Let me know if any problems remain and apologies for the inconvenience caused here.
- December 15, 2022
- 16 replies
- - 2
Lost all views

Deen replied to Jack_Podmore's topic in Service Manager

Hi all, apologies for the disruption caused here, we are looking into a solution and I will have a further update shortly.
- December 14, 2022
- 67 replies
- - 7
Search customer not resolving to known users

Deen replied to Mark (ESC)'s topic in Service Manager

@Mark (ESC) @JanS2000 @Bev Williams A patch has now been released to all instances, you may need to flush your browser cache for the fix to take effect. Please let me know if there are any issues.
- December 14, 2022
- 16 replies
- - 2
Search customer not resolving to known users

Deen replied to Mark (ESC)'s topic in Service Manager

A patch is currently in the works, I'll have another update shortly.
- December 14, 2022
- 16 replies
- - 2
Search customer not resolving to known users

Deen replied to Mark (ESC)'s topic in Service Manager

@Mark (ESC) Our developers are aware of the issue and are working on a fix, I'll have a further update for you shortly.
- December 14, 2022
- 16 replies
- - 6
Unable to load Hornbill

Deen replied to Adrian Simpkins's topic in Service Manager

Apologies to all affected by the outage, affected instances should be coming back on line now. I will have a further update shortly with the root cause.
- December 13, 2022
- 15 replies
- - 4
- - system down
Service Manager is offline

Deen replied to MichelleReaney's topic in Service Manager

For anyone affected, below is the root cause analysis for this disruption: At 11:18 our monitoring detected strange disk activity and high RAM usage on 1 of our NODES (mdh-p01-node16). This NODE provides core services and file attachments (Not Database records) to 5 of our customers. Investigations showed that the high RAM was a direct result of Slow disk access and all actions were therefore being effected. No obvious cause could be found, all underlying hardware was checked and no errors were detected. At 11:39 a controlled shutdown of processes was started to try and identify which process was causing the unexpected disk usage. It became clear that even with nearly all services stopped (except windows core functionality) the issue persisted. All actions taken to recover this was futile. Recovery was attempted for 30 minutes and then DR plan was initiated to restore the effected customer instances to other NODES. As the time line shows. All of these targets were achieved in less than half the RTO time. (DR Plan invoked after 30 mins, Emergency level of service within 2 hours and Restoration of Key services within 5 hours) TimeLine 11:18 Alerted by Monitoring of HIGH RAM - Investigation Began 11:19 Notified HSML of High RAM Instance and Customers still available 11:20 Investigation of Root Cause 11:30 - Xen Toolstack Restart 11:39 - Controlled Shutdown of Non Critical Processes within VM 11:45 - Controlled shutdown of ESP Services 11:50 - Instances and Customers start to become Unavailable 11:55 - All Instances on NODE unavailable. 12:01 Attempted to Restart and Correct Windows 12:02 Informed HSML of Restart of Windows 12:05 Windows Unavailable. Failed to Restart 12:15 Continued Windows Recovery - Unable to progress 12:24 Started DR Planning 12:25 Decision on New Node16 and Migrate Existing Instances to NODE17\18 14:02 All affected instances back up and running 16:52 File restore 100% DONE Total NON ER Downtime (Instance Unavailable) - From 1h7 Mins to 2H Total Recovery - From 2H to 4:52 Root Cause Due to the nature of the issue and loss of diagnostic logs that may exist in the VM we can no longer access we are unsure of the root cause. However the pattern of issues suggests some problem or corruption with the Virtual Disk containing the Windows System. Given the encrypted drives any small corruption would cause large problems. Further Planned\Required Action Rebuild NODE16 and reBalance clusters - New Node exists. Rebalance will be performed over next few weeks Investigate original failure - This will continue. We will not only be investigating the root cause (although as above with logs inside the VM being the most helpful we don't expect a hard answer), we will also be attempting to recover the NODE and its data in the hope of finding a way that should similar issue occur in future we will be able to recover quicker. Storage Servers - Already planned (Hardware already in Place and initial code changes made) was a change from having Storage (for File Attachments etc) local to the NODE. These would act in replicated pairs. Having these would mean that should we lose a NODE we don't need to restore the data from backup servers. (This would have meant once instance was created ALL was available immediately not after 2-4hours) MicroServices - Already Planned and code changes have been made over the last 2 years to support this. Along with the Storage services. The use of micro services removes the need for a home NODE box and all NODE boxes can service any instance. With this in place (And the Storage Servers) a loss of NODE is no concern (And actually part of the normal routine) We have previously never had a corruption like this and believe the chances of another occurrence are low, we also apologise for the disruption caused by the failure.
- December 7, 2022
- 13 replies

Sign In

Deen

Posts

Joined

Last visited

Days Won

Content Type

Profiles

Forums

Enhancement Requests

Everything posted by Deen

Amendments to Layout of Priority / Urgency in Incidents

Xmlmc method BPM error on majority of self-service tickets

Global Status stopped working overnight

Self service - Attachments not coming through

SSO stopped working after Hornbill update?

SSO stopped working after Hornbill update?

Error sending email

Unable to logon to Hornbill

Unable to logon to Hornbill

Framework Issue

Hornbill down? Unable to login

Unable to upload attachments in Hornbill Service Manager tickets this morning.

Forced Service Request Closure Category

Unable to log into instance

Unable to access instance

Unable to access instance

Unable to log into instance

Unable to access instance

Search customer not resolving to known users

Lost all views

Search customer not resolving to known users

Search customer not resolving to known users

Search customer not resolving to known users

Unable to load Hornbill

Service Manager is offline

Browse

Activity

Hornbill

Supportworks