Académique Documents
Professionnel Documents
Culture Documents
judgment or bias. They contain multiple-choice and true-false items and scoring is a mechanical process that requires no special training or knowledge.
Objective tests: tests for which the scoring process is free of personal
Subjective tests: tests that contain essay questions; the scoring process can
be influenced by the personal characteristics and attitudes of the scorer. Test Norms
To interpret the results of a psychological test, a frame of reference or point of comparison must be established so that the performance of one person can be compared with the performance of others. This is accomplished by means of test norms: the distribution of test scores of a large group of people similar in nature to the job applicants being tested. The scores of this group, called the standardization sample (the group of subjects used to establish test norms), serve as the point of comparison for determining the relative standing of the persons being tested. E.g. a high school graduate applies for a job that requires mechanical skills and achieves a score of 82 on a test of mechanical ability. This score alone tells us nothihg about the level of the applicants skill, but if we compare that score of 82 with the test norms the distribution of scores on the test from a large group of high school graduates then we can ascribe some meaning to the individual score. If the mean of the test norms is 80 and the standard deviation is 10, we know immediately that an applicant who scores 82 has only an average or moderate amount of mechanical ability. With this comparative information, we can evaluate objectively the applicants chances of succeeding on the job relative to the other applicants tested. The most widely used psychological tests have sets of norms for different age groups, races, sexes, and levels of education. Reliability Reliability: refers to the consistency or stability of a response on a test. If a group takes a cognitive ability test one week and achieves a mean score of 100, and repeats the test a week later and achieves a mean score of 72, there is sth wrong. We would describe the test as unreliable bacause it yields inconsistent measurements. It is common to find slight variation in test scores when a test is retaken at a later date, but if the fluctuation is great, it suggests that sth is amiss with the test or the scoring method.
involves administering a new test twice to the same group of subjects and correlating the two sets of scores. The closer the correlation coefficient (called in this case the reliability coefficient) approaches a perfect positive correlation of +1.00, the more reliable the test is considered to be. This method has limitations: it is uneconomical to ask employees to take time from their jobs to take the test twice. Also, the effects of learning (remembering the questions) and the influences of other experiences between the 2 testing sessions may cause the group to score higher the second time.
involves administering similar forms of a new test to the same group of subjects and correlating the two sets of scores. Disadvantage: it is difficult and costly to develop two separate and equivalent tests.
administering a new test to a group of subjects, dividing in half the total number of items, and correlating the two sets of scores. Less time-consuming than the other methods bacause only one administration of the test is required. There is no opportunity for learning or recall to influence the second score. In choosing a test for employee selection, the reliability coefficient ideally should exceed +.80, although, in practice, a coefficient of approximately +.70 is considered acceptable. Validity It is the most important requirement for a psychological test or any other selection device. Validity: the determination of whether a psychological test or selection device measures what it is intended to measure. Several different kinds of validity.
Rational Validity: The type of validity that relates to the nature, properties,
2 approaches to establishing the rational validity of a test are: content validity and construct validity.
In content validity, the test items are assessed to ensure that they
adequately sample the knowledge or skills the test is designed to measure. This assessment can be accomplished by conducting a job analysis and determining if the test items are related to all those abilities needed to perform the job. E.g. if your professor says you will be tested on the first 3 chapters of the book, questions about information from other chapters would not be considered content valid).
If tests no longer require expensive validation procedures for every job in every company, then organizations can improve their selection programs while saving time and money. Psychological testing and fair employment practices Increase in validity research to document whether a test is discriminatory > if studies show that applicants of all races who score below a certain level on a test perform poorly on the job, then the test is not discriminatory by race. Research confirms that tests with high validity coefficients have relatively low levels of adverse impact. Criterion-related validation procedures (correlating test scores with job performance measures) are required when possible. A test found to be valid for one group is expected to be valid for another. Racial differences among average test scores are not the result of test bias but arise from educational, social and cultural differences. Differences shown mainly for cognitive ability tests. Race banding: a controversial practice of grouping test scores for minority job applicants to equalize hiring rates. E.g. a companys psychologist supervising the employee selection program could examine the test scores and decide to band or group toogether all applicant scores between, say, 91 and 100, calling the range of scores band 1. Band 2 would encompass all socres between 81 and 90, band 3 the scores between 71 and 80 and so on. For the purpose of hiring equality by race, then, all applicants included in band 1 would be considered equal in terms of the ability being tested. No distinction would be drawn between the applicant with a score of 100 and the applicant with a score of 92. Thus, all applicants in band 1 would be eligible for selection. Further, the order of selection within a band would not necessarily be by test score but rather could be by race. So, a firm might choose to select members of a minority group in the interests of diversity. But, the result might be that only members of the minority group are screened in if the number of openings to be filled is equal to or less than the number of minority group members in the top band. Testing for disabled persons > Many psychological tests have been adjusted or modified for disabled persons. For visually handicapped persons, test questions can be presented orally, in large print, or in braille, and people can be allowed more time in which to complete a test. Test questions relating to color, shape and texture cannot be used with persons who were born without sight and have never seen colors and objects. Applicants with hearing disabilities can be given written instead of oral test instructions.
Establishing a testing program The first requirement is to investigate the nature of the job for which testing will be used. Once job and worker analyses have been conducted, the appropriate tests to measure the behaviors and abilities necessary for job success must be carefully chosen or developed. I/O psychologists have to find appropriate tests to test these abilities. They can either use tests already available or develop new tests specifically for the job and company in question. The best tests include information on reliability, validity and test norms. There are several issues to consider in deciding whether to use a published test or develop a new one > Cost is always important: it is less expensive to purchase a test than to construct one, especially if a small number of employees are to be selected. Time is also important. If the organization needs qualified workers as soon as possible, management may be unwilling to wait for a test to be developed. If an organization decides to develop its own test for a particular job, the I/O psychologists (internal or external) must prepare a list of suitable test items and then validate the test to determine whether it measures what it is supposed to measure. An item analysis will be conducted to evaluate how effectively each test item discriminates between those who score high on the total test and those who score low. This evaluation involves correlating a persons response on each item with the response on the test as a whole. A perfectly valid test question is one that was answered correctly by everyone who who scored high on the complete test and was answered incorrectly by everyone who scored low. Only items with a high correlation coefficient will be retained for the final version of the test. The level of difficulty for each item must be determined. Not all items must have the same level of difficulty. Then the predictive validity of the test must be determined. That is, the scores on the test must be correlated with actual performance on the job. The more common method is to test workers already on the job. The content validity of a test should be also demonstrated: it must be shown that the items deal with abilities that are directrly related to the job. Once validity and reliability of the new test have been found to be satisfactory, a cutoff score must be set. No one scoring below this level will be hired. The cutoff score depends partly on the available labor supply. The greater the number of applicants, the more selective a company can be. The higher the cutoff score, the higher the quality of the applicants hired. Most of the procedures for establishing cutoff scores involve job analyses and criterionrelated validity studies to determine the minimum acceptable level of job performance. One frequently used technique asks subject matter experts to assess the probability that a minimally competent pesron would answer each test item correctly. Types of Psychological Tests
__________________________________________________________________________________ _ Chapter 4 / Page 7 of 14
I/O psychologists categorize tests in 2 ways: 1. how they are constructed and administered, and 2. the skills and abilities they are designed to measure. Individual and Group Tests Group tests: tests designed to be administered to a large number of people at the same time. They can be given to 20, 200 or 2,000 applicants simultaneously. The only limitation is the size of the testing facility. Individual tests: psychological tests designed to be administered to one person at a time. More costly than group tests > used less frequently, usually only for selecting senior management personnel. More popular for vocational guidance and counseling and for diagnostic work with patients. Computer-Assisted Tests Computer-assisted tests: a means of administering psychological tests to large groups of applicants in which an applicants response determines the level of difficulty of succeeding items. They are designed for large scale group testing. Each applicant takes the test at a video display terminal > sometimes called tailored testing because the test is tailored or adapted to the person taking it. In computer-assisted testing you dont have to waste time answering questions below your level of ability. The computer program begins with a question of average difficulty. If you answer correctly, it proceeds to questions of greater difficulty. If not, it asks you less difficult questions. Advantages: - Testing can be done at any time in the selection process. - It is not dependent on finding a qualified test administrator and scheduling a testing session. - Because a range of abilities can be measured in a relatively short time, there is less opportunity for the applicants interest and motivation to diminish. - Fatigue and boredom are reduced, and immediate results are available to the human resources department.
Comparisons of the same tests given in paper-and-pencil format and computeradministered versions show little difference in the resulting scores. Research has shown that most ppl react favorably to computer-assisted testing. On personality tests, they tend to provide more accurate information. For high-level managerial jobs, the company may have an I/O psychologist prepare a summary of the test scores and, based on those scores, a description and judgment of the applicants abilities. Computer-generated reports are also available. Computerized
__________________________________________________________________________________ _ Chapter 4 / Page 8 of 14
test interpretation eliminates the possibility of personal bias in evaluating the test results. Speed and Power Tests The difference between the 2 is the time allotted for completion of the tests. Speed tests: tests that have a fixed time limit, at which point everyone taking the test must stop. Power tests: tests that have no time limit; applicants are allowed as much time as needed to complete the test. A power test often contains more difficult items than a speed test. Large-scale testing programs often include speed tests because all test forms can be collected at the same time. For some jobs, working speed is an important component of successful performance. A power test would not be able to evaluate this skill properly. Paper-and-Pencil and Performance Tests Paper-and-pencil tests: psychological tests in printed form; answers are recorded on a standard answer sheet. Most often and most standard group tests of cognitive abilities, interests and personality belong to this category. Some behaviors or characteristics are not so easily evaluated by paper-and-pencil means. E.g. the mechanical skills of applicants applying for jobs with an appliance repair company are better tested by having the applicants perform the appropriate mechanical operations than by having them answer questions about the nature of those operations. Performance tests: the assessment of complex skills, such as word processing or mechanical ability, for which paper-and-pencil tests are not appropriate. For the assessment of more complex skills, expensive equipment may be needed. Tests of Knowledge, Skills and Abilities Distinction in terms of the characteristics or behaviors they are designed to measure. The basic categories are tests of: cognitive abilities, interests, aptitudes, motor skills, and personality. Cognitive Abilities
Cognitive ability tests (intelligence tests): tests of intelligence or mental ability. Widely used for employee selection > e.g. a survey of personnel practices in departments of law enforcement found that 92% used intelligence to select police officers (Ash, Slora & Britton, 1990). Group intelligence tests, the kind given most often, are a rough screening device. They are brief, take little time to complete and can be administered to large groups. They are rapidly and easily scored by hand or machine. Research has shown that cognitive ability tests are highly valid for predicting success in job training programs and in actual job performance. They correlate highly with measures of job performance such as supervisor ratings, production quantity and quality data, and work samples. Also, the high predictive validities of cognitive ability tests, have been shown to remain stable up to 5 years. Also, cognitive ability is a valid predictor of employee mobility > A study of more than 11,000 employees between the ages of 24 and 30 found that over a 5-year period, ppl tended to gravitate to jobs that suited their level of cognitive ability, whether high or low (Wilk, Desmarais & Sackett, 1995).
Interests
Interest tests: psychological tests to assess a persons interests and preferences; used primarily for career counseling. Include items about daily activities from among which applicants select their preferences. The rationale is that if a person exhibits the same pattern of interests and preferences as people who are successful in a given occupation, then the chances are high that the person will find satisfaction in that occupation. It is important to note that just because a person shows a high degree of interest in a particular job, there is no guarantee that s/he has the ability to be successful in that job. What interest tests show is that the persons interests are compatible with the interests of successful ppl in that career. If a test shows that a person has no interest in a field, then the chances of succeeding in it are limited. One problem with using them for employee selection is the possibility of faking of responses.
Aptitudes
Aptitude tests: tests to measure specific abilities such as mechanical or clerical skills. May measure perceptual speed and accuracy, attention to detail, the capacity to visualize and manipulate objects in space, principles of mechanical operation, ability to operate computers.
Motor Skills
Motor skills tests: tests to measure abilities involving muscle coordination, finger dexterity and eye-hand coordination. Many jobs in industry and the military require abilities. These tests may simulate assembly-line conditions.
Personality
Certain personality characteristics can contribute to job satisfaction and job performance. E.g. empathy and nurturance are important traits for counselors to have; autonomy and persistence are vital to investigative reporters; and orderliness and precision are needed by accountants. Personality tests: assessments of personal traits and feelings. Personality test scores have been found to correlate with job success. Their predictive validities can be as high as those for assessment centers. They are popular selection techniques and their use is expected to increase. 2 approaches to personality assessment are self-report inventories and projective tests.
questions dealing with situations, symptoms and feelings; applicants are asked to indicate how well each item describes themselves or how much they agree with each item.
Honesty is the main difficulty. Research suggests though that few job applicants deliberately distort their responses in a socially desirable direction and that validity measures remain stable regardless of any possible distortion. Predictive validities for self-report personality inventories are generally in the low to moderate scale, but using them to assess the so-called Big Five factors of personality has shown much higher validities. These basic factors have been identified as extraversion, agreeableness, conscientiousness (or the will to achieve), neuroticism and openness to experience. 2 of those factors, conscientiousness and extraversion, have been found to be particularly effective in predicting job performance.
__________________________________________________________________________________ _ Chapter 4 / Page 11 of 14
Conscientiousness includes such characteristics as being responsible and dependable, able to plan, organized and achievement-oriented. Etxraversion, which includes the characteristics of sociability, talkativeness, ambition, assertiveness and high activity level, correlates highly with success for salespersons and managers. A third factor, openness to experience, has been found to predict training proficiency for all tested occupations. Studies of managers who make hiring decisions confirm that conscientiousness (along with general mental ability) is the most important attribute related to hirability.
The task is to tell what is seen in the figure or picture. The rationale is that people will project their thoughts, wishes and feelings onto the stimulus in an effort to give it meaning. These tests cannot be faked because there are no right and wrong answers. They are time-consuming and must be administered individually. Examiners must be thoroughly trained and experienced. Because few of the tests have objective scoring keys, there is ample opportunity for subjective bias to affect the scoring and interpretation of the results. Although projective techniques are used for employee selection at the executive level, research shows that they have low validity.
Integrity tests
Integrity tests: paper-and-pencil tests to predict and detect employee dishonesty. There are 2 types of integrity tests: overt integrity tests, which directly assess attitudes toward theft, and personality-oriented integrity tests, which measure general delinquency, impulse control and conscientiousness. Both types of test appear to be valid predictors of theft and of such counterproductive behaviors as absenteeism, drug abuse and malingering. They have also been found predictive of general job performance as assessed by supervisor ratings.
Ethical issues: APA > 1. Test users: persons who administer and interpret psychological tests should be
aware of the principles of psychological measurement and validation and the limitations of test interpretation. They must avoid bias and should always consider more than one means of assessment. They must adhere to standardized test administration procedures and make every effort to achieve accuracy in recording and scoring test results. 2. Test security: actual test questions should never be reprinted in a public medium such as a newspaper or magazine. It is permissible to publish sample questions (similar to real questions) but not items used in the actual scoring of a test. Tests should be sold only to professionals who will safeguard their use. 3. Test interpretation: test scores should be given only to those qualified to interpret them. They should not be given to anyone outside the human interpret
__________________________________________________________________________________ _ Chapter 4 / Page 13 of 14
department, such as the applicants potential supervisor, unless the supervisor has the training necessary to interpret the scores. The person being tested has the right to know the test score and what it means. 4. Test publication: tests should not be released for use without adequate background research to support the test developers claims. Informative and current test manuals containing reliability, validity and normative data should be made available. Advertisements should describe tests accurately, without emotional or persuasive appeals.