Data from Information Repositories

Sub-techniques (5)

ID	Name
T1213.001	Confluence
T1213.002	Sharepoint
T1213.003	Code Repositories
T1213.004	Customer Relationship Management Software
T1213.005	Messaging Applications

Adversaries may leverage information repositories to mine valuable information. Information repositories are tools that allow for storage of information, typically to facilitate collaboration or information sharing between users, and can store a wide variety of data that may aid adversaries in further objectives, such as Credential Access, Lateral Movement, or Defense Evasion, or direct access to the target information. Adversaries may also abuse external sharing features to share sensitive documents with recipients outside of the organization (i.e., Transfer Data to Cloud Account).

The following is a brief list of example information that may hold potential value to an adversary and may also be found on an information repository:

Policies, procedures, and standards
Physical / logical network diagrams
System architecture diagrams
Technical system documentation
Testing / development credentials (i.e., Unsecured Credentials)
Work / project schedules
Source code snippets
Links to network shares and other internal resources
Contact or other sensitive information about business partners and customers, including personally identifiable information (PII)

Information stored in a repository may vary based on the specific instance or environment. Specific common information repositories include the following:

Storage services such as IaaS databases, enterprise databases, and more specialized platforms such as customer relationship management (CRM) databases
Collaboration platforms such as SharePoint, Confluence, and code repositories
Messaging platforms such as Slack and Microsoft Teams

In some cases, information repositories have been improperly secured, typically by unintentionally allowing for overly-broad access by all users or even public access to unauthenticated users. This is particularly common with cloud-native or cloud-hosted services, such as AWS Relational Database Service (RDS), Redis, or ElasticSearch.^[1]^[2]^[3]

ID: T1213

Sub-techniques: T1213.001, T1213.002, T1213.003, T1213.004, T1213.005

ⓘ

Tactic: Collection

ⓘ

Platforms: IaaS, Linux, Office Suite, SaaS, Windows, macOS

Contributors: Isif Ibrahima, Mandiant; Milos Stojadinovic; Naveen Vijayaraghavan; Nilesh Dherange (Gurucul); Obsidian Security; Praetorian; Regina Elwell

Version: 3.4

Created: 18 April 2018

Last Modified: 15 April 2025

Version Permalink

Live Version

Procedure Examples

ID	Name	Description
G0007	APT28	APT28 has collected files from various information repositories.^[4]
C0040	APT41 DUST	APT41 DUST collected data from victim Oracle databases using SQLULDR2.^[5]
G0037	FIN6	FIN6 has collected schemas and user accounts from systems running SQL Server.^[6]
C0049	Leviathan Australian Intrusions	Leviathan gathered information from SQL servers and Building Management System (BMS) servers during Leviathan Australian Intrusions.^[7]
S1146	MgBot	MgBot includes a module capable of stealing content from the Tencent QQ database storing user QQ message history on infected devices.^[8]
S0598	P.A.S. Webshell	P.A.S. Webshell has the ability to list and extract data from SQL databases.^[9]
S1148	Raccoon Stealer	Raccoon Stealer gathers information from repositories associated with cryptocurrency wallets and the Telegram messaging service.^[10]
G0034	Sandworm Team	Sandworm Team exfiltrates data of interest from enterprise databases using Adminer.^[11]
G1041	Sea Turtle	Sea Turtle used the tool Adminer to remotely logon to the MySQL service of victim machines.^[12]
C0024	SolarWinds Compromise	During the SolarWinds Compromise, APT29 accessed victims' internal knowledge repositories (wikis) to view sensitive corporate information on products, services, and internal business operations.^[13]
S1196	Troll Stealer	Troll Stealer gathers information from the Government Public Key Infrastructure (GPKI) folder, associated with South Korean government public key infrastructure, on infected systems.^[14]^[15]
G0010	Turla	Turla has used a custom .NET tool to collect documents from an organization's internal central database.^[16]

Mitigations

ID	Mitigation	Description
M1047	Audit	Consider periodic review of accounts and privileges for critical and sensitive repositories. Ensure that repositories such as cloud-hosted databases are not unintentionally exposed to the public, and that security groups assigned to them permit only necessary and authorized hosts.^[17]
M1041	Encrypt Sensitive Information	Encrypt data stored at rest in databases.
M1032	Multi-factor Authentication	Use two or more pieces of evidence to authenticate to a system; such as username and password in addition to a token from a physical smart card or token generator.
M1060	Out-of-Band Communications Channel	Create plans for leveraging a secure out-of-band communications channel, rather than existing in-network chat applications, in case of a security incident.^[18]
M1054	Software Configuration	Consider implementing data retention policies to automate periodically archiving and/or deleting data that is no longer needed.
M1018	User Account Management	Enforce the principle of least-privilege. Consider implementing access control mechanisms that include both authentication and authorization.
M1017	User Training	Develop and publish policies that define acceptable information to be stored in repositories.

Detection

ID	Data Source	Data Component	Detects
DS0015	Application Log	Application Log Content	Monitor for third-party application logging, messaging, and/or other artifacts that may leverage information repositories to mine valuable information. Information repositories generally have a considerably large user base, detection of malicious use can be non-trivial. At minimum, access to information repositories performed by privileged users (for example, Active Directory Domain, Enterprise, or Schema Administrators) should be closely monitored and alerted upon, as these types of accounts should generally not be used to access information repositories. If the capability exists, it may be of value to monitor and alert on users that are retrieving and viewing a large number of documents and pages; this behavior may be indicative of programmatic means being used to retrieve all data within the repository. In environments with high-maturity, it may be possible to leverage User-Behavioral Analytics (UBA) platforms to detect and alert on user based anomalies.
DS0028	Logon Session	Logon Session Creation	Monitor for newly constructed logon behavior within Microsoft's SharePoint can be configured to report access to certain pages and documents. ^[19] Sharepoint audit logging can also be configured to report when a user shares a resource. ^[20] The user access logging within Atlassian's Confluence can also be configured to report access to certain pages and documents through AccessLogFilter. ^[21] In AWS environments, GuardDuty can be configured to report suspicious login activity in services such as RDS.^[22] Additional log storage and analysis infrastructure will likely be required for more robust detection capabilities.

Data from Information Repositories

Sub-techniques (5)

Procedure Examples

Mitigations

Detection

References