Getting granular in your thinking can further refine your evaluation of what constitutes a fit for purpose data set.
With a well-defined question(s) in hand, you can begin to zero in on specific outcomes you wish to target. This approach is critical for understanding the fit for purpose nature of a data set. As with the example in step 2, you should avoid overgeneralization and attempt to get as precise as possible in identifying the success measure(s) to be tested.
A lack of specificity often leads to false starts and the potential for lost time and money. One common method that researchers turn to when they skip this step is using the feasibility process to determine if a data asset is research ready.
Multiple question feasibilities do not constitute an effective evaluation of fit for purpose data. This approach is less efficient and prone to inconclusive assessments of such data — being proactive ahead of feasibility assessment is best.
Following the example of bariatric surgery, a list of more specific criteria may look like the following:
- Define what bariatric surgery is. For example, ICD-10 or CPT® codes may be used.
- Ensure that literature scans and/or clinical expertise is engaged to work through these definitions and success metrics to ensure they are accurate.
- Establish how the rate of success will be measured. For example, a change in BMI (specific thresholds may be defined) from pre-surgery to a specific time post-surgery would be a suitable starting point.
- Think through what defines an unsuccessful outcome. In this case, a minimal to no change may be appropriate (again, specific thresholds may be defined).
- Identify any relevant time frames for measurement. For example, one year and five years post-intervention may be isolated.
- Identify any important hard outcomes. In this case, cost outcomes may be desirable, including what the patient and health plan ultimately paid for surgery.
With this example, the research question can now be further refined, as follows:
- What is the success rate of bariatric surgery, as measured by a (potentially specifically defined) reduction in BMI at one year and five years post-surgery, and what percentage of patients do not see success (zero change in BMI)? What are the differences in cost — at both the patient and plan level — for successful versus unsuccessful bariatric surgery patients?
Based on this question, you can see there are 5 key metrics (BMI, time, procedure codes, plan and patient amount paid) that must be resident in or calculable from a data set for it to be considered fit for purpose for this study.
You can also list out any desirable supporting data elements to evaluate, if available. Consider how important those are to the overall analysis. Examples may include supporting data such as demographics, social determinants of health (SDOH) information or other factors that may be used as control variables or to profile populations.