Monthly Air Pollution

Data as of 31 Dec 2022, 23:59

Average monthly concentration of key air pollutants.

0 views·0 downloads

Table

Concentration of Air Pollutants

How is this data produced?

This dataset presents monthly concentrations of various air pollutants, including carbon monoxide (CO), nitrogen dioxide (NO²), ozone (O³), particulate matter (PM₁₀, PM₂.₅), and sulfur dioxide (SO²). The data is tabulated from hourly observational data collected from monitoring stations across Malaysia.

What caveats I should bear in mind when using this data?

A small minority of values in 2017 are null due to inavailability of data.

Publication(s) using this data

Metadata

Dataset description

Average monthly concentration of key air pollutants.

Variable definitions
  • Date
  • Pollutant
  • Concentration
Last updated:

27 Nov 2023, 12:00

Next update:

27 Nov 2024, 12:00

Data source(s)
  • JAS
  • DOSM
License

This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.

Download

Data
Full Dataset (CSV)

Full Dataset (CSV)

Recommended for individuals seeking an Excel-friendly format.

0

Full Dataset (Parquet)

Full Dataset (Parquet)

Recommended for data scientists seeking to work with data via code.

0

Code

Connect directly to the data with Python.

# If not already installed, do: pip install pandas fastparquet import pandas as pd URL_DATA = 'https://storage.data.gov.my/environment/air_pollution.parquet' df = pd.read_parquet(URL_DATA) if 'date' in df.columns: df['date'] = pd.to_datetime(df['date']) print(df)

Sample OpenAPI query

The following code is an example of how to make an API query to retrieve the data catalogue mentioned above. You can use different programming languages by switching the code accordingly. For a complete guide on possible query parameters and syntax, please refer to the official Open API Documentation.

import requests import pprint url = "https://api.data.gov.my/data-catalogue?id=air_pollution&limit=3" response_json = requests.get(url=url).json() pprint.pprint(response_json)