This is the final step where we aggregate each dataset on the basis of the MCD-group columns we gave them in earlier stages, and then merge the precinct-level electoral data with the STF3a census dataset. The datasets on both sides are aggregated at the MCD-group level. For precinct-level electoral data, this means aggregating the precincts into MCD-groups; for STF3a, it means the MCD units will become MCD-groups. At the end of this step both precinct-level electoral data and STF3a should have identical numbers of rows: as many as the number of MCD-groups. The only tricky part is that some variables must be dealt with differently for aggregation: while most are simply summed, those that are originally percentage-based must be dealt with differently.
Although the precinct-level electoral data are split into four years for each state, we decided to integrate them all into the final merged dataset for matching with STF3a, so that only one MCD-group-level dataset would result for each state. This was possible because the precinct-level electoral data variable names contain a code for the two-digit year and are thus each variable is unique among a state's four precinct-level electoral data year datasets. There were numerous identifier variables which were not unique. These were unimportant or nonsensical when aggregated to the MCD-group level, so we simply removed these before aggregating and merging.
This step involves dealing with aggregation issues for variables that cannot be simply summed in the STF3a dataset
What should be left over:
What should be left over:
Use the program mg_xx.sas to produce the final dataset mg_xx.sd2.
What should be left over:
| Copyright © 1997-2004 | [ROAD Home] | Questions? Contact the ROAD webmaster. |