title | description | author | ms.author | ms.localizationpriority | ms.subservice | ms.date |
---|---|---|---|---|---|---|
Datasets, regions, and sinks supported by Microsoft Graph Data Connect | Learn about the supported datasets, Microsoft 365 regions, and sink storage types that you can use with Microsoft Graph Data Connect. | michaelvenables | rimisra | high | data-connect | 11/07/2024 |
Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.
Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and go to the Schema tab.
Dataset name | Description | Learn more |
---|---|---|
OutlookContactActivity_v0 | Provides employees' activity with their contacts in Microsoft Outlook. | OutlookContactActivity_v0 dataset |
OutlookMailActivity_v0 | Provides employees' activity with their email in Outlook. | OutlookMailActivity_v0 dataset |
OutlookMeetingActivity_v0 | Provides employees' activity with their meetings in Outlook. | OutlookMeetingActivity_v0 dataset |
TeamsChannelActivity_v0 | Providesemployees' activity with their channels in Microsoft Teams. | TeamsChannelActivity_v0 dataset |
TeamsConversationActivity_v0 | Provides employees' activity with their teams and chats in Teams. | TeamsConversationActivity_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
TeamsCallRecords_v1 | Provides activity records from Teams calls and meetings. | TeamsCallRecords_v1 dataset |
Dataset name | Description | Learn more |
---|---|---|
TeamsChannelDetails_v0 | Generates a list of Microsoft Teams channels. | TeamsChannelDetails_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
Contact_v0 | Provides contact details available from each user's address book. | Contact_v0 dataset |
Contact_v1 | Provides the contact details available from each user's address book. | Contact_v1 dataset |
Dataset name | Description | Learn more |
---|---|---|
OwnedDevices_v0 | Provides detailed information related to all the devices that are owned by each user in the organization. | OwnedDevices_v0 dataset |
RegisteredDevices_v0 | Provides detailed information related to all the devices that a user is registered on in the organization. | RegisteredDevices_v0 dataset |
LicenseDetails_v0 | Provides details for users' licenses that are directly assigned and those transitively assigned through memberships in licensed groups. | LicenseDetails_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
CalendarView_v0 | Provides occurrences, exceptions and single instances of events, based on the calendar view from users' calendars. | CalendarView_v0 dataset |
ConferenceRoomCalendar_v0 | Provides CalendarView data of the Conference Rooms created for a tenant. | ConferenceRoomCalendar_v0 dataset |
Event_v0 | Provides all the events from users' calendars. | Event_v0 dataset |
Event_v1 | Provides all the events from users' calendars. | Event_v1 dataset |
Dataset name | Description | Learn more |
---|---|---|
GroupDetails_v0 | Provides the Microsoft Entra ID (Azure AD) groups data for a tenant. | GroupDetails_v0 dataset |
GroupMembers_v0 | Generates a list of direct members of all groups. | GroupMembers_v0 dataset |
GroupOwners_v0 | Retrieves the list of all the group owners. | GroupOwners_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
Message_v0 | Provides a collection of all the messages received by a user in mail folders. | Message_v0 dataset |
Message_v1 | Provides a collection of all the messages received by a user in mail folders. | Message_v1 dataset |
SentItems_v0 | Provides a collection of all the sent emails by all users of a tenant. | SentItems_v0 dataset |
SentItems_v1 | Provides a collection of all the sent emails with some additional fields. | SentItems_v1 dataset |
Dataset name | Description | Learn more |
---|---|---|
Inbox_v1 | Provides the messages from users' mail folders. | Inbox_v1 dataset |
Mailfolder_v0 | Provides information on all the folders created in a user's mailbox. | Mailfolder_v0 dataset |
Mailfolder_v2 | Provides the information on all mail folders created in a user's mailbox. | Mailfolder_v2 dataset |
Dataset name | Description | Learn more |
---|---|---|
MailboxSettings_v0 | Provides details of all users' mailbox settings. | MailboxSettings_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
OutlookGroupConversations_v0 | Provides a collection of group conversations between users of tenant. | OutlookGroupConversations_v0 dataset |
TeamChat_v1 | Provides Teams chat messages for one-on-one and group chat messages. | TeamChat_v1 dataset |
TeamChat_v2 | Provides Teams chat messages for one-on-one and group chat messages. | TeamChat_v2 dataset |
TeamsStandardChannelMessages_v0 | Provides channel posts and messages from standard channels in Teams. | TeamsStandardChannelMessages_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
TeamsTranscript_v1 | Provides transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. | TeamsTranscript_v1 dataset |
Dataset name | Description | Learn more |
---|---|---|
DirectReport_v0 | Provides details of all the direct reports for your users. | DirectReport_v0 dataset |
Manager_v0 | Provides a list of users assigned as managers. | Manager_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
TodoTaskFolders_v0 | Identifies task folders in Microsoft Outlook that track user-level work items. | TodoTaskFolders_v0 dataset |
TodoTasks_v0 | Identifies tasks in Microsoft Outlook that track user-level work items. | TodoTasks_v0 dataset |
PlannerTasks_v0 | Identifies tasks in Planner that track user-level work items. | PlannerTasks_v0 dataset |
Dataset name | Description | Learn more |
---|---|---|
User_v0 | Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts that are created for a particular tenant. | User_v0 dataset |
User_v1 | Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts. | User_v1 dataset |
Dataset name | Description | Sample and Schema |
---|---|---|
SharePointSites_v1 | Contains information about SharePoint sites. | SharePointSites_v1 |
SharePointPermissions_v1 | Contains information about sharing permissions. | SharePointPermissions_v1 |
SharePointGroups_v1 | Contains SharePoint group information, including details about group members. | SharePointGroups_v1 |
SharePointFiles_v1 | Contains information about SharePoint files. | SharePointFiles_v1 |
SharePointFileActions_v1 | Contains information about SharePoint file actions. | SharePointFileActions_v1 |
OneDriveSyncHealth_v1 | Contains information about devices running OneDrive for work or school. | OneDriveSyncHealth_v1 |
OneDriveSyncErrors_v1 | Contains details about errors on devices running OneDrive for work or school. | OneDriveSyncErrors_v1 |
Dataset name | Description | Sample and Schema | License |
---|---|---|---|
VivaInsightsDataset_Report_v1_{Viva_Insights_Query_Name} | Contains metrics according to the query authored by the user in Viva Insights. | Varies per report. | Requires Viva Insights license. |
Note:
{Viva_Insights_Query_Name}
represents a placeholder for the Viva Insights query name that, when combined with VivaInsightsDataset_Report_v1_, forms the dataset name.
Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 data center into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.
The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.
Office region | Azure region |
---|---|
Asia-Pacific |
|
Australia |
|
Europe |
|
North America |
|
Brazil |
|
United Kingdom |
|
Canada (CAN) |
|
Japan (JPN) |
|
India (IND) |
|
Korea (KOR) |
|
Switzerland (CHE) |
|
Germany (DEU) |
|
Norway (NOR) |
|
France (FRA) |
|
UAE (UAE) |
|
Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:
- Azure Data Lake Storage Gen2
- Azure Storage Blob
- Azure SQL DB (mapping data flows only)
- Microsoft Fabric OneLake
The following characteristics apply to sinks:
Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.
When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.
For copy activity, the output files are formatted as JSON. This format is fixed and modifying the format isn't supported. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).
Mapping data flows: Copy and transform data from Microsoft 365 (Office 365) - Azure Data Factory & Azure Synapse | Microsoft Learn |
Output can be in parquet format. For details about the supported data transformations, see Flatten transformation in mapping data flow.
Microsoft Graph Data Connect on mapping data flows supports direct output of the data into Azure SQL DB.
The following table indicates the areas that are supported for the corresponding copy activity and mapping data flows.
Area | Copy activity | Mapping data flows |
---|---|---|
Output data formats supported | JSON | JSON, Parquet |
Data transformation (normalization/flattening/etc.) | Requires additional transformation step in the ADF/Synapse pipeline | Supports inline transformations |
Supported data sinks | ADLS gen2, Azure Blob | ADLS gen2, Azure Blob, Azure SQL DB |
Azure VNET IR | Not supported | Supported |