|
1
|
- Job Analysis- Reed A. Castle, PhD
- Item Writing- Steven S. Nettles, EdD
- Test Development- Julia M. Leahy, PhD
- Standard Setting- Paul D. Naylor, PhD
- Scaling/Scoring-Lauren J. Wood- PhD, LP
- 5 topics, 20 minutes and 20 minutes Q&A
|
|
2
|
- Reed A. Castle, Ph.D.
- Schroeder Measurement Technologies, Inc.
|
|
3
|
- An investigation of the ability requirements that go with a particular
job (Credentialing Exam Context).
- It is the study that helps establish a link between test scores
and the content of the profession.
- The Joint Technical Standards14.14
- “The content domain to be covered by a credentialing test should
be defined clearly and justified in terms of importance of the content
for the credential-worthy performance in an occupation or profession.
A rationale should be provided to support a claim that the knowledge
or skills being assessed are required for credential-worthy performance
in an occupation and are consistent with the purpose for which the
licensing or certification program was instituted.”
|
|
4
|
- Need to establish a validity link.
- Need to articulate a rationale for examination content.
- Need to reduce the threat of legal challenges.
- Need to determine what is relatively important practice.
- Need to understand the profession before we assess it.
|
|
5
|
- Focus Group
- Traditional Survey-Based
- Electronic Survey-Based
- Transportability
|
|
6
|
- Need to Identify the best group of SMEs possible
- Areas of Practice
- Geographic representation
- Demographically Balanced
- 8 to 12
|
|
7
|
- Prior to Meeting-
- Comprehensive review of profession
- Job Descriptions
- Performance Appraisals
- Curriculum
- Other job-related documents
- Create a Master Task List
- Send list to SMEs prior to meeting to give them chance to review
|
|
8
|
- At Meeting-
- Review Comprehensive Task List
- Determine which tasks are important
- Determine which tasks are performed with an appropriate level
of frequency
- Determine which tasks are duplicative
- Identify and add missing tasks
- Organize into coherent outline
|
|
9
|
- Advantages-
- May be only solution for new/emerging professions
- Relatively quick
- Less expensive
|
|
10
|
- Disadvantages
- Based on one group (Results may not generalize)
- May be considered a weaker model when considering validation.
- May result in complaints from constituents about the content
of the test.
|
|
11
|
- First steps are similar to the focus group (i.e., task list is
generated in same manner)
- After the task list is
created, three more issues must be addressed to complete the first
survey development meeting.
|
|
12
|
- First, demographic questions must be developed with two goals
in mind.
- Questions should help describe the sample of respondents
- Some Question will be used for analyses
help generalize across groups (e.g., geographic regions)
|
|
13
|
- Second, rating scale(s) should be developed.
- Minimally, two pieces of information should be collected
- Importance or significance
- Frequency of performance
- Additional scales can be added but may take away from response
rate.
- Shorter is sometimes better.
|
|
14
|
- Sample Scale combining Importance and Frequency
- High correlation b/w Freq and Imp Ratings (.95 and higher)
- Considering both the importance and frequency, how important is
this task in relation to the safe, effective, and competent performance
of a Testing Professional? If you believe the task is never performed
by a Testing Professional, please select the 'Not performed' rating.
- 0 = Not performed
- 1 = Minimal importance
- 2 = Below average or low importance
- 3 = Average or medium importance
- 4 = Above average or high importance
- 5 = Extreme or critical importance
|
|
15
|
- Sampling-
- One of the more important considerations is the sampling model
employed.
- Surveys should be distributed to a sample that is reflective of
the entire population.
- Demographic questions help describe the sample.
- One should anticipate a low response rate (20%) when planning
for an appropriate number of responses.
|
|
16
|
- Mailing Surveys
- Enclose a postage paid return envelope.
- Plan well in advance for international mailings (can be logistically
painful with different countries).
- When bulk mailed, plan extra time.
- Keep daily track of return volume.
|
|
17
|
- Identical to traditional, but delivery and return are different.
- Need Email addresses.
- Need profession with ready access to Internet.
|
|
18
|
- Advantages
- Faster response time.
- Data entry is no longer needed.
- Reduced processing time on R & D side.
- Possibly less expense (less admin costs).
- Can modify sampling and survey on the fly if needed
- Sample can be the population with little additional cost.
|
|
19
|
- Disadvantages
- Need Email addresses
- High rate of “bounce-back”
- Control for ballot stuffing
- Data compatibility
|
|
20
|
- Using the results of other job analysis
- Determine compatibility or transportability
- Similar to Focus Group
|
|
21
|
- Focus Group
- Traditional Survey-Based
- Electronic Survey-Based
- Transportability
|
|
22
|
- Demographics
- Importance Ratings
- Frequency Ratings
- Composite
- Sub group Analyses
- Decision Rules
- Reliability
- Survey Adequacy
|
|
23
|
- Geographic Region
- Years Experience
- Work Setting
- Position Role/Function
- Percent Time in certain activities
|
|
24
|
|
|
25
|
|
|
26
|
- Composite ratings using rating scale Natural Logs (when multiple
scales are used) can be calculated and combined based on some weighting
scheme.
- For example, if you want to weight frequency 33.33% and importance
66.66%, you can adjust for this in the composite rating equation.
- Personal opinion is that you will likely end up in a very similar
place if establishing decision criteria on each scale individually.
- In addition, multiple decision rules is more conservative
|
|
27
|
|
|
28
|
- SMEs are asked to determine which assessment type will best measure
a given task
- Multiple choice
- Performance
- Essay/short answer
|
|
29
|
- Each task on the content outline requires some level of cognition
to perform
- 3 basic levels exist (from Bloom’s Taxonomy)
- Knowledge/Recall
- Application
- Analysis
- Steve will discuss in next presentation
|
|
30
|
- Of the remaining tasks-post inclusion decision criteria, SMEs
are asked to rate them on a 3 point scale
- For each major content area, an average rating is calculated
- The average is applied to specific criteria to determine the number
of items by cognitive level for each content area
|
|
31
|
- Weighting is usually done with SME’s based on some type of data
- For example, average importance or composite rating for a given
content area
- Applied to assessment type and cognitive levels.
|
|
32
|
- Standard Exclusion/Inclusion criteria
- Test Specifications
- Assessment type/Cognitive levels
- Weights based on rational approach
- Reflect test-type
- Statistical
- Consensus
|