Are you using our SaaS platform (Baserow.io) or self-hosting Baserow?
SaaS
What are the exact steps to reproduce this issue?
I’m developing an analytics dashboard using Python and Streamlit that connects to our Baserow SaaS instance (baserow.io) via the REST API.
My “Processed Sales Data” table has grown to over 7,000 rows, and I’m noticing that the initial data fetch is becoming quite slow. My current approach is to fetch all rows from the table at application startup to populate the dashboard.
Attach screenshots, videos, or logs that demonstrate the issue.
If I understand correctly then you are fetching all data from baserow every time you want to do analytics?
In that case it will always become slow at some point (it’s only matter of when)
I don’t know what analytics you are doing with that data but what about creating separate table within workspace and use baserow formula system to aggregate/normalize data before fetching? This way you might either get everything within baserow (without scripts), or reduce amount of data needed to be fetch by your scripts.
Yes, you’re right — my whole backend relies on fetching data from Baserow, manipulating and transforming it using Python scripts. I also perform several operations to save the transformed data back into Baserow. For compatibility, all columns in Baserow are set to text (except for a few file-type columns).
It works smoothly when the row count is around ~1K, but when the table grows to 5K–7K rows, the fetching becomes significantly slower — especially compared to Google Sheets API or databases like MySQL. I even tried enabling indexing in Baserow, but didn’t notice any performance improvement.
I was wondering if there’s any API endpoint or method in Baserow that allows faster fetching of large data sets?
Also, SQL query support would be super helpful — though in my use case, I mostly rely on the last 3 months of data, so not sure how much it would help overall.
If there’s any alternative approach or optimization, I’d really appreciate the guidance. Thanks again for the support!
Other approach mentioned is to aggregate/normalize data within baserow. That really depends on what operations do you really need to do on your data, that you can’t get through baserow i.e. formula system?