Hourly origin-destination ridership data for the Komuter service. The table provides a brief preview of the data, but you may download the data in full.
0 viewsΒ·0 downloads
This dataset is derived from anonymous transactional data generated by the KTMB Integrated Ticketing System (KITS). The origin-destination totals are calculated by summing passenger counts across all station-station pairs at hourly frequency. This data represents the most 'raw' form of the data provided by KTMB to support national open data, and is the single source of truth from which all other Komuter ridership datasets are derived.
Because the size of the complete dataset from 2023 onwards exceeds the row limit of Microsoft Excel (1,048,576), we recommend working with the data programatically, preferably using the parquet files provided.
Our site features a dashboard on KTMB ridership where you can explore this data in a more interactive manner.
Hourly origin-destination ridership data for the Komuter service. The table provides a brief preview of the data, but you may download the data in full.
Name in Dataset | Variable | Definition |
---|---|---|
date (Date) | Date | Date in YYYY-MM-DD format |
time (String) | Time | Hour in HH:MM format, with MM set to 00. The 'time' column was produced by taking the hour from the timestamp of the transaction. As an example, a value of '09:00' indicates all transactions occuring between 09:00 and 09:59. This is different from rounding to the nearest hour, where a value of '09:00' would indicate all transactions occuring between 08:30 and 09:29. |
origin (String) | Origin | The station at which the journey began |
destination (String) | Destination | The station at which the journey ended |
ridership (Integer) | Ridership | Number of trips taken from the origin to the destination; it should be noted that this may not necessarily equal the number of passengers, especially for origin-destination pairs with a short travel time. However, because the data is at hourly frequency, the number of passengers is likely to be close or equal to the number of trips. |
31 Mar 2025, 03:31
01 Apr 2025, 03:45
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.data.gov.my/transportation/ktmb/komuter_2025.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)
This data catalog is not available through OpenAPI as the nature of the data makes it unsuitable for API access. For the full dataset, please use the provided download link as shown in the above section.
Β© 2025 Public Sector Open Data