Release date and metadata for all DOSM's publications in 2024.
0 viewsΒ·0 downloads
DOSM's advance release calendar is compiled and coordinated by the Strategic Communications Division for the purpose of display on the main DOSM portal. It contains three fields, namely the title in English, title in Malay, and release date. To produce the structured dataset shown here, the OpenDOSM team conducts standardisation of naming conventions and adds several metadata fields which are typically helpful for users, such as category, frequency, and release series (in date and natural language formats).
β
The data shown here tallies exactly with the Upcoming Publications calendar on OpenDOSM.
Release date and metadata for all DOSM's publications in 2024.
Name in Dataset | Variable | Definition |
---|---|---|
publication_id (String) | Publication ID | Unique ID identifying the publication; this should be the primary key if used in a database. |
release_date (Timestamp) | Release Date | Date and time of release in YYYY-MM-DD HH:MM:SS format. The publication is embargoed until such time. |
title_en (String) | Publication Title (EN) | Title of the publication in English, in the format {title}: {data up to} |
title_bm (String) | Title Publication (MS) | Title of the publication in Malay, in the format {title}: {data up to} |
publication_type (String) | Publication Category | Category of the publication indicating its broad domain. For example, all publications relating to GDP and satellite accounts are categorised as 'gdp'. |
publication_type_en (String) | Publication Category (EN) | Category of the publication in English |
publication_type_bm (String) | Publication Category (MS) | Category of the publication in Malay |
frequency (Categorical) | Frequency of data | Either YEARLY, QUARTERLY, or MONTHLY. Note that this is the frequency of the data series within the publication, NOT the frequency of the publication itself. |
release_series (Date) | Series | Data as of this date, in YYYY-MM-DD format. The earliest date in the period is chosen, e.g. YYYY-01-01 for annual data and YYYY-04-01 for 2nd quarter data. |
release_series_date (String) | Series (Formatted Date) | The 'release_series' field formatted according to frequency. For instance, 2023-01-01 is formatted as 2023 for yearly data, 2023-q1 for quarterly data, and 2023-01 for monthly data. |
release_series_en (String) | Series (EN) | The 'release_series' field formatted as natural language in English. For instance, 2023-01-01 is formatted as 2023 for yearly data, Q1 2023 for quarterly data, and Jan 2023 for monthly data. |
release_series_bm (String) | Series (MS) | The 'release_series' field formatted as natural language in Malay. For instance, 2023-01-01 is formatted as 2023 for yearly data, ST1 2023 for quarterly data, and Jan 2023 for monthly data. |
28 Nov 2024, 08:00
29 Nov 2024, 08:00
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.dosm.gov.my/meta/arc.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)
The following code is an example of how to make an API query to retrieve the data catalogue mentioned above. You can use different programming languages by switching the code accordingly. For a complete guide on possible query parameters and syntax, please refer to the official Open API Documentation.
import requests
import pprint
url = "https://api.data.gov.my/data-catalogue?id=arc_dosm&limit=3"
response_json = requests.get(url=url).json()
pprint.pprint(response_json)
Β© 2024 Public Sector Open Data