案例研究
CSI’s 圣骑士的监控 Saves One Client From A Major 电子邮件 Outage and Allows Us To Proactively Work On Another’s 电子邮件 Outage Issue
CSI的圣骑士远程监控 solution uncovered two major Exchange 2010 email crisises in the span of an hour on Wednesday. 其中一个世界杯在线投注不到700名用户,另一个世界杯在线投注约1350名用户.
第一个事件是圣骑士给了我们一个叫做“背压”的交换警报. This is where Exchange believes that it is going to be unable to do its job based upon the rate of server resources (RAM and disk space) it is consuming. 交换然后试图保护核心. The thing it generally does first is shutdown email flow into and out of the Exchange server. 这就引出了这样的问题:“你怎么知道你发送的邮件没有收到?? 你怎么知道你发送的邮件是否没有收到? 圣骑士 知道. Since CSI actively watches the alert consoles and doesn't solely rely on automated alerting to our clients, we went"old school" and picked up the phone talked to the appropriate person who didn't know they were not getting
电子邮件. 我们和他们一起解决了这个问题. A simple resource allocation change of their virtual environment and a quick reboot and those 700 users continue to do what they do 与out worrying about, “为什么电子邮件没了??”
当我们结合圣骑士的监控与圣骑士邮件国防我们可以做得更好.圣骑士邮件国防 provides us 24x7x365 SMS text alerts when mail flow into and out of an email server stops and starts. 如果停电是由于真正的灾难情况, 圣骑士邮件国防 immediately switches into a disaster recovery mode where the clients inbound email that cannot be delivered to their mail server is immediately available via the web. 邮件服务器可能已经死亡或建筑物被摧毁, 但如果你能在哪儿找到上网的地方, you still are able to send and receive critical 电子邮件 until whatever bad happened is resolved. 如果情况是暂时的, 圣骑士邮件国防 will just restart the inbound and outbound mail flow automatically as soon as the connection is re-established and then notify everyone via SMS that normal mail flow is working again.
第二次Exchange事件发生在第一次事件的一个小时后. 不幸的是,一个Exchange服务器提供对大约1个服务器的访问,350个用户有一个高CPU条件. 这会导致用户性能下降. 没有任何警告. 前一分钟还很正常. 下一分钟,它就在一个糟糕的地方. 圣骑士 提醒我们. We were already looking into the outage when the phone rang from the customer reporting strange performance issues in Exchange. 在本例中,我们无法阻止性能的下降. 没有人能一直这么做. 然而,在我们的世界杯预选赛投注知道有一个紧急问题之前,我们就知道了. 我们积极地努力尽快解决问题,尽量减少停机时间. About 20 minutes after the event started we had it resolved and everyone went back to work. 从警报到紧急警报的响应时间大约是三分钟.
你不可能知道你的社交网络中正在发生或即将发生的一切. 通过覆盖24 x7x365 圣骑士远程监控 we can provide you 与 the ability to know things about your network that are impossible to know on your own. 通过覆盖 圣骑士邮件国防 we can provide an added layer of disaster recovery protection for your critical email communications. 你是如何知道你所不知道的人脉的?
CSI的圣骑士监控使另一个世界杯预选赛投注免于过多的停机时间
CSI的圣骑士远程监控解决方案在过去几天有一个令人印象深刻的保存.
上周我们有一个ISP去了现场,几个小时后做了例行的硬件升级/交换. 中断是计划和预期的. 这是一个快速的进,出,回联机. 圣骑士 看到世界杯预选赛投注端站点离线(按计划). 然而,该网站再也没有回来. 时间过去了,它仍然没有回来. 小时过去了. 很明显,出了什么可怕的差错. 如果这种情况持续到早上,我们的世界杯预选赛投注就会遭殃. 有2,100 users sitting behind this one connection - many of whom would be quite angry if this wasn't resolved. We placed the appropriate after hours calls to the appropriate people and around 10:45pm the ISP re-visited the client and quickly resolved the connectivity issues created by their upgrades. 最终用户甚至不知道已经发生了中断. 负责那个网站的人知道是因为 圣骑士 不管他们是否站在那里,他们是否在24x7x365地监视着那个地点. We knew not just to rely on an automated"you are down" alert because we try very hard to have interactive discussions 与 our clients and go the extra mile in trying to keep them healthy. 在这种情况下,是在下班后, 现场“人”监控-只是为了确保一切顺利. 你不可能知道你的社交网络中正在发生或即将发生的一切.
通过覆盖24 x7x365 圣骑士远程监控 we can provide you 与 the ability to know things about your network that are impossible to know on your own. 有太多的数据需要筛选. In both these instances we were able to uncover substantial issues and deal 与 them before they became a major crisis 与 lots of unhappy users.
你是如何知道你所不知道的人脉的?
CSI通过飓风艾琳监控我们世界杯预选赛投注的网络
当飓风艾琳逼近纽约时, CSI used our 24x7x365 圣骑士的监控 service to help our clients prepare their computers and networks for the impending hurricane. We were able to quickly identify all the uninterruptible power supplies (aka batteries) under management which had bad batteries or other hardware issues. Equipment plugged into these battery units had a greater than normal exposure to power fluxuations.
一个世界杯预选赛投注站点打算在风暴期间关闭其所有业务. Before they shut their equipment down we identified a server that was compromised 与 bad drives in a RAID array and other hardware issues. Our concern was that since this critical server already had a failed redundant component plus other issues, 它可能会被关闭,永远不会重新上线.
意识到时间对修复服务器至关重要, we were able to use 圣骑士's remote management tools to remotely reach into the server at 12am Saturday as the storm approached to rebuild the redundant drive and re-establish full redundancy before the server was actually shutdown. 世界杯预选赛投注从不需要起床. 没人会出现让我们进入大楼, 关闭警报并解锁进入服务器壁橱所需的多个门. 在我们的工作完成之后, 服务器按照世界杯预选赛投注的计划宕机了,但在风暴后恢复正常.
During the storm we pro-actively monitored our customer’s networks and provided active status updates as we saw buildings and servers go down due to power failures throughout the region. By looking at the previous alerts and querying the power supplies we were able to identify the difference between"no power" and actual equipment failures.
Once the storm subsided on Sunday we were able to pinpoint exactly what buildings were offline around the region. Then as those buildings came back on-line we were able to pinpoint exactly what equipment inside each building did not turn back on. From there we had a list of devices for either the client's technical staff or CSI的 staff to investigate.
Sunday night I was personally watching over our client's networks via the 圣骑士 monitoring console. 在这中间,我家里停电了. 我走到外面,启动了发电机. 然后,我打开笔记本电脑上的Verizon无线网卡,一点也没错过.
CSI的 office has an ample standby generator of its own and an excellent internet connection so our 24x7x365 monitoring continued regardless of the storm conditions.
有一次,尽管洪水泛滥,星期一早上还是来了, road closures and massive power outages in some areas most of our clients went back to work 与 their computer networks operating much like they did on Friday when they left for the weekend.
这就是CSI的圣骑士监控所做的.