How Human Health Is Measured: Key Metrics and Biomarkers

Measuring health sounds deceptively simple until someone points out that a marathon runner can have a resting heart rate of 40 beats per minute and be perfectly fine, while the same reading in a sedentary adult might warrant a cardiology referral. Health measurement is the science of turning something deeply personal — how a body functions, adapts, and ages — into numbers and patterns that can be tracked, compared, and acted upon. This page covers the major categories of health metrics and biomarkers used in clinical and public health settings, how they're interpreted, and where the important judgment calls happen.

Definition and scope

A health metric is any quantifiable indicator used to assess, monitor, or predict the state of biological functioning. A biomarker — a term formalized in clinical research — is a specific subset: a measurable biological characteristic that reflects normal processes, disease states, or responses to treatment. The National Institutes of Health Biomarkers Definitions Working Group defines a biomarker as "a characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacological responses to a therapeutic intervention" (NIH Biomarkers Working Group).

The scope is wide. Metrics can be structural (bone density measured in grams per square centimeter), functional (lung capacity measured in liters via spirometry), behavioral (minutes of moderate physical activity per week), or biochemical (hemoglobin A1c as a percentage). Together, they form the measurement infrastructure behind everything from a routine annual physical to a large-scale national survey.

Physical health provides one of the clearest illustrations of this range: a single clinic visit might capture blood pressure, body mass index, fasting glucose, and a lipid panel — four completely different windows into the same body, each telling a different story.

How it works

Health measurement operates at two distinct levels: individual clinical assessment and population-level surveillance.

At the individual level, clinicians use a structured set of tools:

  1. Vital signs — blood pressure (normal: below 120/80 mmHg per American Heart Association guidelines), heart rate, respiratory rate, temperature, and oxygen saturation. These are fast, cheap, and informative.
  2. Anthropometric measures — height, weight, waist circumference, and derived values like body mass index (BMI). BMI is calculated as weight in kilograms divided by height in meters squared; the CDC classifies a BMI of 30 or above as obese (CDC BMI Classification).
  3. Laboratory biomarkers — blood, urine, and tissue analyses. The hemoglobin A1c test reflects average blood glucose over approximately 90 days; the American Diabetes Association sets a diagnostic threshold of 6.5% or above (ADA Standards of Care).
  4. Functional assessments — spirometry for lung function, grip strength dynamometry, echocardiography for cardiac output. These measure what the body can do, not just what it looks like chemically.
  5. Patient-reported outcomes — validated tools like the PHQ-9 for depression screening or the PROMIS (Patient-Reported Outcomes Measurement Information System) scales developed by NIH. These capture dimensions of health — pain, fatigue, emotional state — that no blood test can touch.

At the population level, agencies like the CDC and the National Center for Health Statistics aggregate individual measurements into surveillance datasets. The National Health and Nutrition Examination Survey (NHANES), for instance, combines physical examinations and laboratory tests from a nationally representative sample of roughly 5,000 participants per two-year cycle to track health metrics and indicators across the US population.

Common scenarios

The metrics used shift considerably depending on the clinical question.

Cardiovascular risk assessment draws on a cluster of biomarkers: LDL cholesterol, HDL cholesterol, triglycerides, blood pressure, fasting glucose, and increasingly, high-sensitivity C-reactive protein (hs-CRP) as a marker of systemic inflammation. The American College of Cardiology's Pooled Cohort Equations combine these into a 10-year atherosclerotic cardiovascular disease (ASCVD) risk score — a percentage that directly informs treatment thresholds. Cardiovascular health is one of the few areas where clinicians routinely use algorithmic risk scoring rather than single-value thresholds.

Diabetes monitoring centers on hemoglobin A1c but pairs it with fasting plasma glucose and, in some protocols, continuous glucose monitoring data expressed as "time in range" — the percentage of a 24-hour period that blood glucose stays between 70 and 180 mg/dL. Time-in-range targets vary by patient population but the International Consensus on Time in Range sets 70% as a minimum goal for most adults with Type 1 or Type 2 diabetes (Battelino et al., Diabetes Care 2019).

Mental health screening relies almost entirely on validated self-report instruments because no blood test diagnoses depression or anxiety disorder. The PHQ-9 scores nine items on a 0–27 scale; scores of 10 or above suggest moderate depression and typically prompt further clinical evaluation. This contrast — biochemical precision for metabolic disease versus structured questionnaires for mental illness — illustrates a fundamental asymmetry in measurement maturity across key dimensions of human health.

Decision boundaries

The line between "normal" and "abnormal" is less fixed than it appears on a lab report. Reference ranges are statistical constructs derived from population distributions — a value flagged as high or low simply means it falls outside the central 95% of a reference group. That reference group matters enormously.

Cholesterol thresholds, for example, have shifted repeatedly as cardiovascular outcome data accumulated. In 2013, the American College of Cardiology and American Heart Association moved away from treating to specific LDL targets toward risk-based treatment decisions — a significant philosophical shift in how a single biomarker is used clinically.

Age, sex, and population ancestry introduce further complexity. Glomerular filtration rate (GFR) equations used to estimate kidney function have been revised to remove race-based adjustments following a 2021 National Kidney Foundation and American Society of Nephrology task force recommendation, reflecting how deeply social and scientific factors intertwine in measurement practice. Health equity considerations are now formally embedded in how clinical decision thresholds are set and revised.

Biomarkers also require clinical context. A ferritin level of 12 ng/mL might confirm iron deficiency in a fatigued patient and be diagnostically unremarkable in someone with concurrent infection, because ferritin is also an acute-phase reactant that rises with inflammation — making it simultaneously a useful and misleading signal depending on circumstances. Understanding health risk factors and the conditions under which a biomarker was collected is as important as the number itself.

References