Daily origin-destination ridership data covering the entire Rapid Rail network in the Klang Valley. The table provides a brief preview of the data, but you may download the data in full.
0 views·0 downloads
This dataset is derived from anonymous transactional data generated by Rapid's tap-in-tap-out system, which governs the gates at all rail stations in the Klang Valley. The origin-destination totals are calculated by summing passenger counts across all station-station pairs at daily frequency.
Because the size of the complete dataset for each year since 2023 exceeds the row limit of Microsoft Excel (1,048,576), we recommend working with the data programatically, preferably using the parquet files provided.
Our site features a dashboard on Rapid Rail ridership where you can explore this data in a more interactive manner.
Daily origin-destination ridership data covering the entire Rapid Rail network in the Klang Valley. The table provides a brief preview of the data, but you may download the data in full.
Name in Dataset | Variable | Definition |
|---|---|---|
date (Date) | Date | Date in YYYY-MM-DD format |
origin (String) | Origin | The station at which the journey began |
destination (String) | Destination | The station at which the journey ended |
ridership (Integer) | Ridership | Number of trips taken from the origin to the destination; it should be noted that this may not necessarily equal the number of passengers, because the same passenger may make multiple trips on that route within the same day. |
03 Apr 2026, 17:51
04 Apr 2026, 22:00
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.data.gov.my/transportation/rail/rapidrail_2026_daily.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)This data catalog is not available through OpenAPI as the nature of the data makes it unsuitable for API access. For the full dataset, please use the provided download link as shown in the above section.
© 2026 Public Sector Open Data