The Workforce Evidence-Based Spending Guide

Strategy 1: Define Evidence of Effectiveness

2. Prioritize Evidence 3. Link Payments to Outcomes 4. Use Active Contract Management 5. Build Evidence Through Evaluations 6. Braid, Blend, or Sequence Funding

Why It Matters

Clearly defining “evidence of effectiveness” is a key step in including evidence standards in government contract and grant applications. By determining what counts as evidence, workforce development practitioners can evaluate and score proposals against those standards and set clear expectations for applicants.

In a broad sense, evidence of effectiveness refers to all the evaluations, analyses, and other work showing how well a program or policy worked in achieving its goals. To make procurement and grant-making more evidence-based, workforce agencies will need a specific evidence framework, with corresponding equity considerations, that fits their unique contexts.

The Role of Evidence Definitions

Evidence definitions allow governments to more clearly and effectively shift dollars to what works and deliver better results for all through their grants, contracts, budgets and purchasing. RFA’s updated evidence framework encourages government decision-makers to consider the full body of evidence for a policy, program, or practice. This includes taking into account the local context and how it affects implementation and results, and focusing on why, how and for whom specific programs work. These definitions can also help governments incorporate valuable input from the community and build a culture of ongoing learning and improvement. By using evidence definitions, government decision-makers can direct public funding to solutions that are most likely to work in their communities.

Overview

An evidence framework or continuum is a tool to categorize practices, policies, and programs based on the number and types of rigorous evaluations that demonstrate policy-relevant results. These frameworks are not only useful for steering funds toward programs that are effective but can also help identify programs where more information is needed and where to invest funds in new evaluations.

Evidence frameworks generally contain the following:

A set of categories to reflect the rigor, quantity, and outcomes of an evaluation. Categories may be designated numerically or with descriptive titles such as high, medium, low or promising, theory-based, and mixed effects.
A definition for each category, usually including some quantitative indicators such as the number of evaluations, studies, or peer-reviewed reports. May also include timelines such as 1 year or 5 years of outcomes, as well as considerations for data disaggregation to understand equity within subpopulations.
A designation for new or emerging programs that may not yet have evidence but are important to the work of the agency.

Frameworks may also include supplementary documents such as instructions for category selection to be used by respondents or outcomes requirements that direct applicants as to what data to collect, track, and report and the specific methodology to use.

Evidence-Based Program

A program with either impact or implementation evidence that is relevant, credible, and has an informed rationale

Evidence-Building Program

A program that has an informed rationale and is undergoing an impact evaluation or implementation evaluation that is relevant and credible

RFA’s Evidence Definitions

Building on the strong foundation of evidence definitions already in use, Results for America is sharing new definitions of “evidence-based” and “evidence-building” that help pave the way for governments to more clearly and effectively shift dollars to what works and deliver better results for all.

Learn more

Examples

Evidence Frameworks Can Boost Performance

Evidence frameworks vary across the workforce field, as government agencies determine how many tiers they’ll use, the specific criteria for each, and the role that equity will play. As part of their participation in Results for America’s State and Local Workforce Fellowship, Pennsylvania, Colorado, and Texas each developed or strengthened evidence frameworks to inform their contracting and grantmaking processes.

Here’s an overview of the evidence continuums the Colorado Workforce Development Council and Pennsylvania’s Department of Labor and Industry have adopted as part of RFA’s State and Local Workforce Fellowship:

Pennsylvania and Colorado

Strong Evidence

Two or more rigorous evaluations support the program model

Moderate Evidence

At least one evaluation report has demonstrated that an intervention or strategy has been tested using a well-designed and well-implemented experimental or quasi-experimental design showing evidence of effectiveness on one or more key workforce outcomes. The evaluations should be conducted by an independent entity external to the organization implementing the intervention.

Preliminary Evidence

At least one evaluation report has demonstrated that an intervention or strategy has been tested using a well-designed and well-implemented pre/post-assessment without a comparison group or a post-assessment comparison between intervention and comparison groups showing evidence of effectiveness on one or more key workforce outcomes. The evaluation may be conducted either internally or externally.

Pre-preliminary Evidence

There is program performance data for the intervention showing improvements for one or more key workforce outputs or outcomes

See Colorado Workforce Development Council’s evidence continuum in action on pages 5-6 of this Reskilling, Upskilling, and Next-skilling Workers (RUN) grant, and review the Pennsylvania Department of Labor and Industry’s statewide Evidence of Effectiveness definitions.

After publishing the statewide Evidence of Effectiveness definitions with the Pennsylvania Department of Labor and Industry, the Pennsylvania Workforce Development Board conducted a field survey to understand more about local partners and how the definitions impacted them. The survey collected data from over 100 respondents across 12 workforce and education program types. Metrics included:

Self-assessed organizational/programmatic evidence rating (pre-preliminary, preliminary, moderate, or strong)
Organization’s current budget for evaluation
History of participating in external evaluations

Respondents also shared that staff expertise and capacity, data access, and non-staff related costs were the main barriers to building and using evidence, which could be mitigated through dedicated funds for evidence building within grant awards and improved technical assistance opportunities. This survey process ensured the state understood what barriers, supports, and resources would be required to equitably apply the Evidence of Effectiveness definitions.

Texas

The Texas Workforce Commission (TWC) chose to use a wider range of tiers beyond high, middle, and low in its evidence framework to include applicants with promising practices or the desire to build evidence of effectiveness through their work. TWC allows providers to receive funding for newer programs if they build evidence of effectiveness over the grant or contract period through improved data collection and evaluation. For a more detailed look at Texas’ Evidence Framework, click here.

High Evidence

At least two approved rigorous studies must show that the program produces positive and meaningful outcomes, with a high degree of confidence that the outcome is primarily caused by the program.

Moderate Evidence

Program must be supported by an approved rigorous evaluation which finds that the program has a positive and meaningful outcome, with a modest degree of confidence that the outcome is primarily caused by the program.

Performance

Program must provide historical output and outcome data for at least two years, along with assessments and post-program follow-up to demonstrate effectiveness.

Experience

Programs do not perform evaluations of participant success or collect data on the effectiveness of the program. Support for program effectiveness comes from anecdotal success stories or other testimonials.

New

Entirely new programs with no evidence of effectiveness or evaluation data. Applicants must explain why the program will achieve positive measurable outcomes and that there is sufficient capacity to collect data and track outcomes.

Other Strong Examples of State Evidence Frameworks

Minnesota Management and Budget Office Evidence Framework

The Results First Initiative within the state of Minnesota’s Management and Budget Office created an evidence framework to rate the state’s programs and services. In this framework, programs and services are considered evidence-based if they fall in the “Proven Effective” or “Promising” categories. Ratings from each program are publicized in the Minnesota Inventory, a quasi state evidence clearinghouse.

Note: For Minnesota, research includes “programs and services previously evaluated and featured in a national clearinghouse or meta-analysis (with respect to the nature, length, frequency, and target population.”

Proven Effective

A Proven Effective service or practice offers a high level of research on effectiveness for at least one outcome of interest. This is determined through multiple qualifying evaluations outside of Minnesota or one or more qualifying local evaluation. Qualifying evaluations use rigorously implemented experimental or quasi-experimental designs.

Promising

A Promising service or practice has some research demonstrating effectiveness for at least one outcome of interest. This may be a single qualifying evaluation that is not contradicted by other such studies but does not meet the full criteria for the Proven Effective designation. Qualifying evaluations use rigorously implemented experimental or quasi-experimental designs.

Theory Based

A Theory Based service or practice has either no research on effectiveness or research designs that do not meet the above standards. These services and practices may have a well-constructed logic model or theory of change. This ranking is neutral. Services may move up to Promising or Proven Effective after research reveals their causal impact on measured outcomes.

Mixed Effects

A Mixed Effects service or practice offers a high level of research on the effectiveness of multiple outcomes. However, the outcomes have contradictory effects. This is determined through multiple qualifying studies outside of Minnesota or one or more qualifying local evaluation. Qualifying evaluations use rigorously implemented experimental or quasi-experimental designs.

No Effect

A service or practice rated No Effect has no impact on the measured outcome or outcomes of interest. Qualifying evaluations use rigorously implemented experimental or quasi-experimental designs.

Proven Harmful

A Proven Harmful service or practice offers a high level of research that shows program participation adversely affects outcomes of interest. This is determined through multiple qualifying evaluations outside of Minnesota or one or more qualifying local evaluation. Qualifying evaluations use rigorously implemented experimental or quasi-experimental designs.

Tennessee

The state of Tennessee’s Office of Evidence and Impact evidence framework considers programs evidence-based if they are supported by at least one rigorous evaluation. This framework is used as part of a program inventory to identify how funding is allocated across different programs, the services provided, and any evidence tied to the program’s outcomes. For Tennessee, rigorous evaluations include randomized controlled trials and quasi-experimental designs that use comparison groups.

Strong Evidence

Two or more rigorous evaluations support the program model.

Evidence

At least one rigorous evaluation supports the program model.

Outcomes

Data collected over time demonstrate a change or benefit for participants.

Outputs

Process measures to support continuous improvement.

Logic Model

“If we do X, Y, and Z activities, then we expect to see A, B, and C results.”

New Mexico

In 2019, the state of New Mexico passed the Evidence and Research Based Funding Requests Act, which defined four tiers of evidence and required state agencies to categorize sub-programs according to these tiers and report on the amount allocated for each of these evidence tiers. Each year, New Mexico’s Legislative Finance Committee oversees this work, providing budget guidance for agencies, recommendations for evidence-based programs through their Legislating for Results framework, training, and technical assistance.

Evidence-Based

A program or practice: (1) incorporates methods demonstrated to be effective for the intended population through scientifically based research, including statistically controlled evaluations or randomized trials; (2) can be implemented with a set of procedures to allow successful replication in New Mexico; and (3) when possible, has been determined to be cost beneficial.

Research-Based

A program or practice has some research demonstrating effectiveness, but does not yet meet the standard of evidence-based.

Promising

A program or practice, based on statistical analyses or preliminary research, presents potential for becoming research-based or evidence-based.

Lacking Evidence of Effectiveness

Programs or practices that do not fall into the other three evidence tiers.

Sample Evidence Frameworks from Federal Agencies

The U.S. Department of Labor’s Clearinghouse for Labor Evaluation and Research (CLEAR), includes descriptive, implementation, and impact studies for workforce development and employment-related programs on a wide variety of topics. CLEAR rates the evidence presented in impact studies as high, moderate, or low depending on how confident they can be that the study outcomes are attributable to the program. Review CLEAR’s rating criteria here.

The U.S. Department of Education’s What Works Clearinghouse (WWC) reviews and summarizes studies of education programs – including postsecondary career and technical education programs – and assigns those programs to strong or moderate evidence tiers.

AmeriCorps has a common evidence framework for funding decisions in the Senior Corps and AmeriCorps state and national programs, including pre-preliminary, preliminary, moderate, and strong evidence tiers. In FY22, 64% of competitively awarded funds were invested in interventions with moderate and strong evidence. In 2023, thirty states used this federal definition of evidence in their most recent AmeriCorps grant applications.

RFA’s Evidence Honor Roll

Get access to a catalog of programs that define and prioritize evidence of effectiveness with RFA’s “Honor Roll of State Grant Programs that Define and Prioritize Evidence.” State agencies can nominate their state grant program(s) for potential inclusion using this form. Nominations and updates the honor roll on a rolling basis.

Go Deeper

Embedding Equity

Equity Considerations for Evidence Frameworks

Using evidence and data without an equity lens can perpetuate disparities despite our best intentions. Simply delivering more services to historically underserved populations does not necessarily address the ongoing gap in available evidence about what works within these populations. To use an equity lens within an evidence framework necessitates building evidence and investing in programs that work for specific populations, including those that are historically underserved. While the explicit incorporation of an equity lens into the development and use of a framework is newer in its application, it is no less critical to ensuring that high quality solutions not only have the necessary data to demonstrate that they are successful, but also that data is disaggregated to address disparities in outcomes. Below are three ways workforce development agencies can build equity into their evidence frameworks.

Prioritize research and evaluation

Prioritize and fund research and evaluation approaches that empower workers and communities to be full partners in the research, from designing the research and interpreting data to shaping how findings are used. Funding evidence building through impact and implementation evaluations can help to reduce the gap between evidence that is currently available and more comprehensive evidence that will enable workforce agencies to understand what works, for whom, and under what circumstances.

Community-Based Participatory Action Research (PAR) is an umbrella term for research approaches that engage community members as full partners at every stage of the research process, including shaping research questions and empowering community members to collect and analyze data. PAR disrupts historical power imbalances between researchers and communities, generating better research and knowledge. This approach recognizes that deep expertise lies within each community, making research done without community input incomplete and inaccurate. The Conservation Law Foundation’s PAR Field Guide includes guidance on setting up a PAR project, developing research questions and tools, and collecting and analyzing data. The Healthy Neighborhoods Study is a PAR project examining the impact of gentrification on the health of residents in nine communities in the metropolitan Boston area. Residents and community groups drive the study in partnership with researchers and regional planners.

The Urban Institute’s Community Voice and Power Sharing Guidebook offers practical advice on partnership building, community advisory boards, community-engaged survey development, and youth engagement. Urban also has a guide for holding Data Walks, in which program administrators and service providers empower program participants and community members by sharing program outcome data or research findings with them. Data Walks give community members an opportunity to ground the data in their lived experiences, shedding light on how and why programs and services are serving their community well or not. The guide includes detailed descriptions of how the Urban Institute used Data Walks to share data with public housing residents in Chicago, Illinois and Portland, Oregon as part of a demonstration project testing innovative wraparound services for those residents. The Urban Institute team has also used a Data Walk in Washington, DC to gather community input on the design of its Promoting Adolescent Sexual Health and Safety project.

Compensate community input

Embed equity in the evidence frameworks by seeking out and compensating the input of those who will be most impacted by programs, such as jobseekers (particularly jobseekers from historically marginalized groups), neighborhood groups in historically disadvantaged areas, community-based organizations, and community-based training providers.

In Training and Employment Guidance Letter No. 21-22, Increasing Equitable Service Access and Employment Outcomes for All Jobseekers in Workforce Innovation and Opportunity Act Adult and Dislocated Worker Programs, the U.S. Department of Labor Employment and Training Administration (ETA) encourages workforce development agencies to create partnerships with existing community groups that work in underserved communities, as they are well known by the community and are valuable sources of feedback for the workforce system. ETA encourages workforce agencies to invite these organizations to share concerns to ensure services are responsive to community needs, and to budget funds to compensate organizations for this expertise. The Urban Institute has developed guidance for building community partnerships.

Workforce practitioners can seek input from jobseekers, compensating them for their time, on whether a program helped them by administering surveys or focus groups. Additionally, listening sessions with community-based organizations can shed light on whether a program was effective in their community and how they define evidence of effectiveness for their community. The Community Toolbox, a compilation of community development tools, guides, and resources hosted by the Center for Community Health and Development at the University of Kansas, has guidance for gathering community input through focus groups and surveys. Racial Equity Tools also offers a number of resources on equitable data collection methods. Worker boards can be a useful source of information on the perceived equity of a given policy. See this case study on worker boards from Harris County, Texas.

Apply evidence to everyone

Ensure “evidence of effectiveness” applies to everyone. Workforce agencies can encourage applicants to highlight service delivery approaches that have been shown to be effective for participants from marginalized communities or who belong to certain demographic groups. Knowing that an approach was effective overall, but was less effective for a high-priority subgroup, or that an evaluation omitted certain subgroups from data disaggregation, will be helpful in fully assessing the evidence behind an approach and guiding where new, more equitable evidence needs to be built. This report from the Joint Center for Political and Economic Studies has more information on the importance of data disaggregation in evaluating workforce development programs.

The Local and Regional Government Alliance on Race & Equity and Municipal Research and Services Center of Washington have a number of equity tools and resources geared toward local governments.

Getting Started

How to Get Started

Determine if your agency has an existing definition, continuum or framework. If not, examine whether such materials have been issued by your state agency.
Review the examples in this guide and determine which approach best reflects the needs of your organization.
Using one of the examples in the guide (or from other government agencies) as a starting point, draft a set of tiers that reflect your organization’s needs. Consider (a) how you will apply your new framework; (b) the degree to which the programs you plan to assess are well established or emerging; (c) the capacity that exists in potential respondents; and (d) the role equity will play in your framework.
Define each tier of the framework. Provide enough descriptive information that a potential grantee, service provider or evaluator can easily understand what is required to meet the tier.
Consider sharing a draft of the framework with trusted partners to gauge understanding and identify potential gaps.
Test the framework against completed interventions, grants or procurements. Consider what information would have been readily available, how the application of the framework would have impacted the process, and what additional information would have been needed.
Finalize your framework, along with any accompanying guidance on how it should be used.
Create opportunities to discuss your framework with potential respondents before it is formally implemented in upcoming procurements. This may include offering presentations to partners, delivering training, or making technical assistance available to respondents pre-submission.

Sample Evidence Frameworks from Federal Agencies