Skip to content

Commit fc9d455

Browse files
author
Eric Sethna
authored
Merge pull request #609 from mattermost/sso-failover
SSO failover
2 parents bdf28cd + 9183833 commit fc9d455

File tree

1 file changed

+50
-1
lines changed

1 file changed

+50
-1
lines changed

source/administration/backup.rst

Lines changed: 50 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ To backup your Mattermost server:
2828
To restore a Mattermost instance from backup, restore your database, ``config.json`` file and optionally locally stored user files into the locations from which they were backed up.
2929

3030
Disaster Recovery
31-
-----------------
31+
---------------------------
3232

3333
An appropriate disaster recovery plan weighs the benefits of mitigating specific risks against the cost and complexity of setting up disaster recovery infrastructure and automation.
3434

@@ -59,3 +59,52 @@ A properly deployed high availability setup automatically switches over to a red
5959

6060
A "complete" disaster recovery solution would protect against both real-time hardware failures using high availability, data corruption failures using automated, and failures of the primary data center by offering both offsite backup and offsite redundant infrastructure. Because the complexity of a full disaster recovery solution is high, it is common for customers to consider trade-offs in cost and complexity relative to the anticipated risks and target recovery times.
6161

62+
Failover from single sign-in outage
63+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
64+
65+
When using single sign-on with Mattermost Enterprise Edition an outage to your SSO provider can cause a partial outage on your Mattermost instance.
66+
67+
**What happens during an SSO outage?**
68+
69+
_Most people can still log in_ - By default, when a user logs in to Mattermost they receive a session token lasting 30 days (the duration can be configured in the System Console). During an SSO outage, users with valid session tokens can continue to using Mattermost uninterrupted.
70+
71+
_Some people can't log in_ - During an SSO outage, there are two situations under which a user cannot log in:
72+
73+
a) Users whose session token expires during the outage
74+
75+
b) Users trying to log in to new devices
76+
77+
In each case, the user cannot reach the SSO provider, and cannot log in. In this case, there are several potential mitigations:
78+
79+
1) Configure your SSO provider for high availability
80+
81+
If you're using a self-hosted single sign-on provider, several options are available for `high availability configurations that protect your system from unplanned outages. <https://docs.microsoft.com/en-us/microsoft-identity-manager/pam/high-availability-disaster-recovery-considerations-bastion-environment>`_
82+
83+
For SaaS-based authentication providers, while you still have a dependency on service uptime, you can set up redundancy in source systems from which data is being pulled. For example, with the OneLogin SaaS-based authentication service, you can set up `high availability LDAP connectivity <https://support.onelogin.com/hc/en-us/articles/204262680-High-Availability-for-LDAP>`_ to further reduce the chances of an outage.
84+
85+
2) Set up your own IDP to provide an automated or manual SSO failover option
86+
87+
Create a custom Identity Provider for SAML authentication that connects to both an active and a standby authentication option, that can be manually or automatically switched in case of an outage.
88+
89+
In this configuration, security should be carefully reviewed to prevent the standby SSO option from weakening your authentication protocols.
90+
91+
3) Set up a manual failover plan for SSO outages
92+
93+
When users are unable to reach your organization's SSO provider during an outage, an error message informing the users to contact your support link (defined in your System Console settings) is displayed.
94+
95+
Once IT is contacted about an SSO outage issue, they can temporarily change a user's account from SSO to email-password using the System Console, and the end user can use password to claim the account, until the SSO outage is over and the account can be converted back to SSO.
96+
97+
If the administrator is unable to log into the System Console because of the SSO outage, they can switch their authentication method to email-password to gain access using the `command line tool <https://docs.mattermost.com/administration/command-line-tools.html>`_.
98+
99+
It is highly important after the outage to switch everyone back to SSO from email-password to maintain consistency and security.
100+
101+
102+
103+
104+
105+
106+
107+
108+
109+
110+

0 commit comments

Comments
 (0)