National Cancer Institute
dccps logo
Outcomes Research Branch
Cancer Control and Population Sciences

Brief Description of the Surveillance, Epidemiology & End Results (SEER)-Medicare Health Outcomes Survey (MHOS)-Level Analysis File

The following is a brief description about how the analysis file was created, who is included in the file, the structure of the file, and the content of the file. For ease of reference, variable names are included in parentheses.

The SEER-MHOS is a survey-level analysis file organized chronologically, based on the earliest survey administration date. The file includes all Medicare Advantage enrollees from Cohorts 1 to 8 that have completed at least one MHOS. Up to twelve records per person may be available in this file. This would occur for enrollees who were included in the MHOS sample every year, and responded to all eight baseline and follow-up surveys. The survey records are arranged chronologically starting with the earliest completed survey, and include the survey date (SRVDATE).

The survey-level file includes information for people with and without cancer. The variable (INSEER) identifies MHOS respondents who were found in the SEER file and were diagnosed with cancer between 1973 and 2007. Dates of diagnosis and clinical information, for up to 10 cancer sites, from the SEER file are included in each survey record that belongs to SEER-linked respondents. A number of variables were calculated to describe the timing of the survey relative to cancer diagnosis including the patient's cancer status at the time of the survey (CASTAT). These variables can be used to select a subset of respondents or surveys based on pre/post-cancer diagnosis; for example, selecting number of cancers before survey equal to 0 (NUMCABEF = 0) will keep only surveys prior to any cancer diagnosis. For more examples, refer to Sample SAS Code for Patient and Survey Selection. The SEER variables were copied from the SEER-Medicare file (PEDSF) and are described in Appendix A of the Data Dictionary.

The file also contains demographic information obtained from sources including the Enrollment Database (EDB) maintained by Centers for Medicare & Medicaid Services (CMS), SEER file, and self-reported information. Selected Census 2000 data at the census tract and zip code levels around the time of first cancer diagnosis were added to the survey record. In addition, zip code-level census data at the time of the survey were added to the record. SF-36 scales, as well as, physical and mental summary scores are provided on the T-score metric; for Cohorts 1-6 Baseline and Follow up Cohorts 7-8 Follow up.  For more information, refer to How to Use the SF-36 Measures. For 2006 survey years and later, physical and mental summary scores are provided based on the Veterans Rand (VR-12), but the SF-36 scales and T-scores are no longer available. In addition, VR-12 equivalent scores were calculated for Cohorts 1-6 Baseline and Follow up and Cohorts 7-8 Baseline.

For each data request, the file will be subset on respondents with the requested cancer site. Respondents may have more than one cancer site, and information for up to 10 sites is stored in an array for each SEER data item. The variable (CAINDX) indicates the number of the array element (1-10) which corresponds to the requested cancer site.


Footer begins
Last modified:
08 May 2012
Search | Contact Us | Accessibility | Privacy Policy  
Division of Cancer Control and Population Sciences National Cancer Institute Department of Health and Human Services National Institutes of Health USA.gov: The US government's official web portal