List of datasets published via the data catalogue
0 views·0 downloads
This data is aggregated from the list of datasets extracted from the site's internal API.
The list of datasets here is fully consistent with the site's data catalogue.
List of datasets published via the data catalogue
Name in Dataset | Variable | Definition |
|---|---|---|
id (String) | Dataset ID | Unique identifier for the dataset |
date_created (Date) | Date Created | Date the dataset was created, in YYYY-MM-DD format |
title_en (String) | Title (English) | Title of the dataset in English |
category_en (String) | Category (English) | Dataset's category in English |
subcategory_en (String) | Subcategory (English) | Dataset's subcategory in English |
title_bm (String) | Title (Malay) | Title of the dataset in Malay |
category_bm (String) | Category (Malay) | Dataset's category in Malay |
subcategory_bm (String) | Subcategory (Malay) | Dataset's subcategory in Malay |
source (String) | Source | Name(s) of the dataset's source agency |
frequency (String) | Data Frequency | The frequency of the data contained within the dataset |
geography (String) | Geography | Geographical level covered by the dataset |
demography (String) | Demography | Demographic breakdowns contained within the dataset |
dataset_begin (Integer) | First Year | Earliest year covered by the dataset |
dataset_end (Integer) | Last Year | Most recent year covered by the dataset |
15 Jan 2026, 00:02
16 Jan 2026, 00:10
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.data.gov.my/metrics/dataset_list.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)The following code is an example of how to make an API query to retrieve the data catalogue mentioned above. You can use different programming languages by switching the code accordingly. For a complete guide on possible query parameters and syntax, please refer to the official Open API Documentation.
import requests
import pprint
url = "https://api.data.gov.my/data-catalogue?id=datasets&limit=3"
response_json = requests.get(url=url).json()
pprint.pprint(response_json)© 2026 Public Sector Open Data