State-level Household Income by Percentile

Data as of 2022

Percentile-resolution household income data at state level.

0 views·0 downloads

Table

Household Income Distribution

How is this data produced?

This data is is based on Household Income Surveys (HIS) and Household Income & Expenditure Surveys (HIES) carried out from 1970 to 2022. The survey is carried out at least twice in any rolling five-year period to produce representative data regarding income, poverty and access to basic amenities for Malaysian households. In particular, this dataset provides deeper insight into the distribution of household income at state level, by tabulating key household income metrics by percentile (100 equally sized groups), rather than the typically-published decile (10 groups) or quartile (4 groups). There are 4 metrics on monthly household income provided in the dataset:

  • Mean for percentile K, which is the average among households within percentile K only.
  • Median for percentile K, which is the median among households in percentile K only. An additional implication is that (K - 0.5)% of households have monthly household income less than or equal to this amount. For example, 12.5% of households have monthly household income less than or equal to the median within the 13th percentile.
  • Minimum income for percentile K, which is the minimum among households in percentile K. All households within percentile K have income greater than or equal to this amount.
  • Maximum income for percentile K, which is the maximum among households in percentile K. No households within percentile K have income exceeding this amount.

What caveats I should bear in mind when using this data?

This data should be used with caution as the relative standard errors are larger due to the higher resolution. For a full account of the relative standard errors for the survey as well as the detailed survey methodology, please refer to the HIES Technical Notes.

Publication(s) using this data

Metadata

Dataset description

Percentile-resolution household income data at state level.

Variable definitions
  • Date
  • State
  • Percentile
  • Variable
  • Income
Last updated:

28 Jul 2023, 12:00

Next update:

N/A

Data source(s)
  • DOSM
License

This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.

Download

Data
Full Dataset (CSV)

Full Dataset (CSV)

Recommended for individuals seeking an Excel-friendly format.

0

Full Dataset (Parquet)

Full Dataset (Parquet)

Recommended for data scientists seeking to work with data via code.

0

Code

Connect directly to the data with Python.

# If not already installed, do: pip install pandas fastparquet import pandas as pd URL_DATA = 'https://storage.dosm.gov.my/hies/hies_state_percentile.parquet' df = pd.read_parquet(URL_DATA) if 'date' in df.columns: df['date'] = pd.to_datetime(df['date']) print(df)

Sample OpenAPI query

The following code is an example of how to make an API query to retrieve the data catalogue mentioned above. You can use different programming languages by switching the code accordingly. For a complete guide on possible query parameters and syntax, please refer to the official Open API Documentation.

import requests import pprint url = "https://api.data.gov.my/data-catalogue?id=hies_state_percentile&limit=3" response_json = requests.get(url=url).json() pprint.pprint(response_json)