Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.
Datasets
Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and go to the Schema tab.
Activities
Call records
Dataset name | Description | Learn more |
---|
TeamsCallRecords_v1 | Provides activity records from Teams calls and meetings. | TeamsCallRecords_v1 dataset |
Channel
Dataset name | Description | Learn more |
---|
Contact_v0 | Provides contact details available from each user's address book. | Contact_v0 dataset |
Contact_v1 | Provides the contact details available from each user's address book. | Contact_v1 dataset |
Devices and Licenses
Dataset name | Description | Learn more |
---|
OwnedDevices_v0 | Provides detailed information related to all the devices that are owned by each user in the organization. | OwnedDevices_v0 dataset |
RegisteredDevices_v0 | Provides detailed information related to all the devices that a user is registered on in the organization. | RegisteredDevices_v0 dataset |
LicenseDetails_v0 | Provides details for users' licenses that are directly assigned and those transitively assigned through memberships in licensed groups. | LicenseDetails_v0 dataset |
Event
Dataset name | Description | Learn more |
---|
CalendarView_v0 | Provides occurrences, exceptions and single instances of events, based on the calendar view from users' calendars. | CalendarView_v0 dataset |
ConferenceRoomCalendar_v0 | Provides CalendarView data of the Conference Rooms created for a tenant. | ConferenceRoomCalendar_v0 dataset |
Event_v0 | Provides all the events from users' calendars. | Event_v0 dataset |
Event_v1 | Provides all the events from users' calendars. | Event_v1 dataset |
Group
Mail
Dataset name | Description | Learn more |
---|
Message_v0 | Provides a collection of all the messages received by a user in mail folders. | Message_v0 dataset |
Message_v1 | Provides a collection of all the messages received by a user in mail folders. | Message_v1 dataset |
SentItems_v0 | Provides a collection of all the sent emails by all users of a tenant. | SentItems_v0 dataset |
SentItems_v1 | Provides a collection of all the sent emails with some additional fields. | SentItems_v1 dataset |
Mail folder
Dataset name | Description | Learn more |
---|
Inbox_v1 | Provides the messages from users' mail folders. | Inbox_v1 dataset |
Mailfolder_v0 | Provides information on all the folders created in a user's mailbox. | Mailfolder_v0 dataset |
Mailfolder_v2 | Provides the information on all mail folders created in a user's mailbox. | Mailfolder_v2 dataset |
Mailbox settings
Message
Online meetings
Dataset name | Description | Learn more |
---|
TeamsTranscript_v1 | Provides transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. | TeamsTranscript_v1 dataset |
Org hierarchy
Task
Dataset name | Description | Learn more |
---|
TodoTaskFolders_v0 | Identifies task folders in Microsoft Outlook that track user-level work items. | TodoTaskFolders_v0 dataset |
TodoTasks_v0 | Identifies tasks in Microsoft Outlook that track user-level work items. | TodoTasks_v0 dataset |
PlannerTasks_v0 | Identifies tasks in Planner that track user-level work items. | PlannerTasks_v0 dataset |
User
Dataset name | Description | Learn more |
---|
User_v0 | Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts that are created for a particular tenant. | User_v0 dataset |
User_v1 | Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts. | User_v1 dataset |
OneDrive and SharePoint Online
Dataset name | Description | Sample and Schema |
---|
SharePointSites_v1 | Contains information about SharePoint sites. | SharePointSites_v1 |
SharePointPermissions_v1 | Contains information about sharing permissions. | SharePointPermissions_v1 |
SharePointGroups_v1 | Contains SharePoint group information, including details about group members. | SharePointGroups_v1 |
SharePointFiles_v1 | Contains information about SharePoint files. | SharePointFiles_v1 |
SharePointFileActions_v1 | Contains information about SharePoint file actions. | SharePointFileActions_v1 |
OneDriveSyncHealth_v1 | Contains information about devices running OneDrive for work or school. | OneDriveSyncHealth_v1 |
OneDriveSyncErrors_v1 | Contains details about errors on devices running OneDrive for work or school. | OneDriveSyncErrors_v1 |
Viva Insights
Dataset name | Description | Sample and Schema | License |
---|
VivaInsightsDataset_Report_v1_{Viva_Insights_Query_Name} | Contains metrics according to the query authored by the user in Viva Insights. | Varies per report. | Requires Viva Insights license. |
Note:{Viva_Insights_Query_Name}
represents a placeholder for the Viva Insights query name that, when combined with VivaInsightsDataset_Report_v1_, forms the dataset name.
Regions
Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 data center into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.
The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.
Office region | Azure region |
---|
Asia-Pacific | |
Australia | - Australia East
- Australia Southeast
|
Europe | |
North America | - Central US
- East US
- East US 2
- North Central US
- South Central US
- West Central US
- West US
- West US 2
|
Brazil | |
United Kingdom | |
Canada (CAN) | - Canada Central
- Canada East
|
Japan (JPN) | |
India (IND) | |
Korea (KOR) | |
Switzerland (CHE) | |
Germany (DEU) | |
Norway (NOR) | |
France (FRA) | |
UAE (UAE) | |
Sinks
Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:
The following characteristics apply to sinks:
Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.
When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.
For copy activity, the output files are formatted as JSON. This format is fixed and modifying the format isn't supported. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).
Mapping data flows: Copy and transform data from Microsoft 365 (Office 365) - Azure Data Factory & Azure Synapse | Microsoft Learn |
Output can be in parquet format. For details about the supported data transformations, see Flatten transformation in mapping data flow.
Microsoft Graph Data Connect on mapping data flows supports direct output of the data into Azure SQL DB.
The following table indicates the areas that are supported for the corresponding copy activity and mapping data flows.
Area | Copy activity | Mapping data flows |
---|
Output data formats supported | JSON | JSON, Parquet |
Data transformation (normalization/flattening/etc.) | Requires additional transformation step in the ADF/Synapse pipeline | Supports inline transformations |
Supported data sinks | ADLS gen2, Azure Blob | ADLS gen2, Azure Blob, Azure SQL DB |
Azure VNET IR | Not supported | Supported |
Related content