Idea Options
- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Report Inappropriate Content
Honor ETag or Cache-Control to reduce OData network traffic by an order of magnitude, including developer experience
Submitted by
guido_leenders
on
11-04-2021
03:05 AM
In this idea, I will explain how Finance and Data Professionals can experience a significant improvement in their productivity when Power BI Desktop would honor the ETag or Cache-Control headers of OData feeds. The idea also explains how the quantity of avoidable network traffic for OData can be reduced by up to 1% when used with Power BI Desktop. On an OData Producer over 20,000 companies access their cloud sources (typically European cloud apps such as Twinfield or Exact Online) as an OData feed. Each API on the cloud platforms becomes an entity set ultimately; interim step is a table in our SQL engine. Over 90% users are finance and data professionals relatively new to Power BI. They often stick to Power BI Desktop for quite some time. The technology stack of Power BI, M, Invantive SQL and cloud paltforms allows filters to be automatically rewritten and forwarded across the whole chain up-to into the ultimate cloud platforms connected to. Problem is that during developing and testing the reports, Power BI Desktop continuously retrieves the same OData4 data set again and again. Using caching on the OData Producer, subsequent request for a data set are sent back instantly. However, the data sets are typically large, even up to over 1 GB gzip-compressed network size and over 10 GB uncompressed due to the large amount of financial transactions and number of companies contained. We find that during normal use 75% of the OData requests can be answered by the OData response cache, and during development of reports up to 99% of the requests are answered from the OData response cache by the OData producer. The OData answers include ETag and Cache-Control, but their values are ignored by Power BI Desktop. Google Chrome (the vi-variant of Power BI Desktop :-)) honors them correctly and this extremely reduces network traffic. So, a large amount of avoidable network traffic is initiated when using OData feeds. Of course experienced users vertically filter the data (for instance by company or year) and make the development process run faster. But our platform focuses on Finance and Data Professionals, not on IT-professionals. This audience has typically less experience and is less well-trained to act within some arbitrary technical boundaries. It would improve performance for developers by an order of magnitude when Power BI Desktop would honor the ETag and Cache-Control. Downloading a 200 MB dataset typically takes a few minutes due to network bandwidth restrictions, plus some processing. For more background see https://forums.invantive.com/t/sneller-data-verwerken-uit-invantive-cloud-op-geselecteerde-odata-clients/1706. Summarizing: when Power BI Desktop would honor the ETag or Cache-Control headers on OData feeds, inexperienced users would experience a significant improvement in their productivity, while network traffic drops to as little as 1%.
See more ideas labeled with:
Comments
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Latest Comments
-
Aala_Ali
on: Importing data (or drag and drop) from Fabric Lake...
- anic on: Enhancing Purview Glossary Integration with Power ...
- yeyu47 on: Deployment Pipeline roles
- giusepper11 on: Reintroduce Workspace Name visibility for Lakehous...
-
michaelu1
on: Scheduled refreshes were turned off after two mont...
-
anshulsharma on: Integrate Fabric Eventhouse with Azure AI Agent se...
- tom_vanleijsen on: Hide "updating" spinners in real-time dashboards
-
kleigh
on: change button slicer selected item color
- SimonKAKI on: OneLake Cross-Region Mirroring
-
jovanpop-msft on: Add native OPENROWSET(json) support in Fabric DW
Idea Statuses
- New 15,034
- Need Clarification 5
- Needs Votes 22,636
- Under Review 640
- Planned 267
- Completed 1,649
- Declined 221
-
Power BI
38,763 -
Fabric platform
537 -
Data Factory
445 -
Data Factory | Data Pipeline
290 -
Data Engineering
265 -
Data Warehouse
186 -
Data Factory | Dataflow
154 -
Real-Time Intelligence
128 -
Fabric platform | Workspaces
122 -
Fabric platform | OneLake
119 -
Fabric platform | Admin
114 -
Fabric platform | CICD
89 -
Fabric platform | Capacities
66 -
Real-Time Intelligence | Eventhouse and KQL
61 -
Real-Time Intelligence | Activator
53 -
Fabric platform | Governance
51 -
Fabric platform | Security
48 -
Data Science
47 -
Data Factory | Mirroring
37 -
Databases | SQL Database
31 -
Fabric platform | Support
31 -
Real-Time Intelligence | Eventstream
29 -
Fabric platform | Data hub
28 -
Databases
22 -
Data Factory | Apache Airflow Job
3 -
Fabric platform | Real-Time hub
3 -
Product
2 -
Real-Time Hub
1