Application Series 17


LABOUR FORCE SURVEY ­ CERTAINLY SIR

Mike Staley of the UK Office of Population Censuses and Surveys (OPCS) talks about SIR's use in Labour Force Surveys.

Background

The Labour Force Survey (LFS) is conducted by OPCS on behalf of the Employment Department and has existed in various forms since the UK entry into the European Community. It collects data relating to employment and, using the processed results, monitors employment patterns on a national scale. The results of the LFS are extremely accurate and constitute an increasingly useful tool for formulating policy decisions.

In the early years LFS was a biennial affair processed on an ICL mainframe. After evolving to an annual survey, it has become a quarterly, panel-based survey sampling over 60,000 addresses each quarter and using a PC based interactive questionnaire interviewing and collection package, Blaise. "Gentlemen, we have seen the future and it works..."

We provide the Employment Department with primary data, derived variables and abstract the sample to the total population (a process known as grossing). This work is done in SIR. An efficient Blaise-SIR interface has proved vital and we have devised a method of generating SIR database structure in Blaise metadata files. This has enabled us to maintain the same data shape and simply load the data into SIR - no fuss, no mess.

Analysis

The LFS derives three sets of variables: at the person, family and household levels. Those for the household and family have proved very stable; by and large, they are simple in concept. Those at the person level are a bit more interesting; besides which, we are here down to the basic unit, the data level which is of most interest to the Employment Department.

There are around seventy person derived variables, many of them involving searches of look-up tables in order to classify a respondent's occupation or industry code, or place of work. Some of these tables are small, and accommodated as temporary arrays held in SIR's workspace when being calculated. Larger ones have been designated as separate record- types to reduce the memory demands of the process.

Grossing, the process of extending the sample to the total population, is simple in concept, but can be bewildering in terms of the scale and number of the processing operations which have to be gone through. The whole operation is performed in tabfiles, which lends itself to this sort of high-speed data manipulation. tablespace limitations have forced some programs to be divided into modules.

Output

For data processing, the hierarchical structure of the data - persons within households within quota - has to be adhered to. Once processing has finished, however, this structure is no longer necessary. Various outputs are provided, such as SPSS, SAS, SIR export and ASCII files, as required by Employment Department clients. These different types of outputs are all available through SIR and frequently produce a flat file, consisting of one record type only.

If we write all of the variables which we need for these outputs to a single file, we can cut out the need to wander hither and thither through a hierarchical database. When there are several outputs to be run off the same database, this is definitely worthwhile.

SIR SAVE FILE

The use of the SIR SAVE FILE utility has proved invaluable in the generation of LFS output. It produces a flat-file database, consisting of one record type only. It can be used to efficiently generate a collection of outputs as mentioned above.

The Employment Department requirements increase continually as we continue to deliver the goods. Much of these extra requirements involve tabulating the data by other area definitions - for instance, in terms of travel to work areas, or rural district cooperation, or counties. This is easy enough, given the definitions of the new areas in terms of our standard areas.

We output a SIR SAVE FILE database consisting only of those variables which we need for this particular work, and run the tables in that. This provides significantly quicker access. For example, a job which takes three hours or more in the full hierarchical database, may take a mere ten minutes in the subset.

Flexibility

Although the SIR SAVE FILE produces a flat-file database, the structure can be augmented if necessary. For purposes of grossing on the LFS, we use only four variables. Recently it became necessary to restructure our grossing suite to meet revised targets; the option used involved regrouping the person records, and introducing a dummy household record to control the processing. We generated our flat-file database via SIR SAVE FILE and introduced a new record type, running the necessary record schema. This has worked splendidly, as indeed it should.

We are able therefore to combine the speed advantage of SIR SAVE FILE with the structural advantage of a modicum of hierarchy. The resulting hybrid is far from being a Frankenstein's Monster, and runs considerably faster than the original hierarchical database.

Conclusion

Essentially, the Labour Force Survey is a fast moving survey. Being so much larger than formerly, the results are increasingly useful to the Employment Department, and their vision of its possible application are - like the universe - in a state of perpetual expansion. Our processing techniques have to keep changing, as a consequence. It is an invigorating challenge to keep up with the demands - to regard the present methods of processing as being in a permanent state of review. SIR has been used to process the data for a good many years now, throughout the various recensions of the Survey. In that time, though, SIR has changed a good deal - but more importantly, we have been making more and more optimum use of SIR.

For more information on the use of SIR at the OPCS contact:

Michael Staley
OPCS
St Catherine's House
10 Kingsway
London WC2B 6JP UK

Back Sir Home