As you may be aware, UTLogin provides centralized authentication services for more than 250 campus applications and processes more than 55 million authentication requests annually. As with all Identity & Access Management (IAM) services, our goal is for UTLogin to be reliable, secure, and easy-to-use. In the past year, we have not met the reliability expectations of campus (and ourselves) as UTLogin has experienced a number of outages.
I apologize for the disruption these issues have caused you, your customers, and your business processes. We are committed to addressing the root causes of these issues and restoring UTLogin to stable operations.
Although the IAM team has implemented fixes and mitigations after each service outage, new issues with different immediate causes continue to appear, pointing to a deeper set of problems that need to be addressed. After analyzing the 20 UTLogin service incidents that occurred from June 2016 to May 2017, we believe that the overall instability issues are caused by a combination of three major factors:
- Customizations and Non-Standard Configuration – The OpenAM vendor product upon which UTLogin is based was heavily customized during implementation to meet unique UT Austin requirements, directly causing some issues and making diagnosis of other issues more difficult.
- Aging System Components – The software components of the UTLogin system are at or reaching end-of-life, limiting support options and making issues harder to diagnose and address.
- Changing Demands on UTLogin – The number and complexity of systems integrated with UTLogin have greatly increased, introducing new demands on the system.
The IAM team has developed a roadmap to address these issues and return UTLogin to stability:
- Action 1: Stabilize the Current Environment – Put the current system in “critical fix only” mode, strictly manage configuration changes, and stop unproductive investment of time chasing down issues in the current environment. (Status: Complete)
- Action 2: Simplify & Standardize UTLogin – Upgrade system components to current supported versions, remove customizations and non-standard configurations, minimize external dependencies, and review and simplify the authentication policy model. (Status: In progress. Expected completion for Requirements and Design: September 2017; Implementation timeline will depend on the Design.)
- Action 3: Measure & Report Progress – Monitor key performance indicators (KPIs) and report progress toward improving stability to UTLogin customers and stakeholders. (Status: In progress. Expected completion for KPI reporting: July 2017)
You can read the complete the UTLogin Stability Roadmap here: http://links.utexas.edu/byjfjw.
We appreciate your continued support as we work to keep the University’s online environment safe and secure. If you have any questions, please send them to firstname.lastname@example.org.
MARIO A. LEAL, Jr., Senior IT Manager
The University of Texas at Austin | ITS Applications | 512-471-6954 | utexas.edu