Comprehensive Review of Effective Application of Questionnaires for Clinical Research on Lower Urinary Tract Symptoms With Translation and Cultural Adaptation to the Korean Language
Article information
Abstract
Lower urinary tract symptoms are highly prevalent and closely related to patients’ quality of life. Clinical research on urologic disease is essential for accumulating evidence on patient management; however, the major obstacle is converting patients’ subjective symptoms to objective parameters. The optimal application of well-developed and validated questionnaires is vital in achieving objectivity and minimizing bias in clinical research. Numerous questionnaires for measuring symptoms and quality of life in urologic diseases have been developed worldwide; however, they cannot be directly used in clinical studies without validation processes. This review aimed to explain the common procedures for translation, linguistic, and psychometric validation of developed questionnaires from other languages. Furthermore, we comprehensively reviewed currently available questionnaires for evaluating lower urinary tract symptoms in the Korean population.
INTRODUCTION
Lower urinary tract symptoms (LUTS) are a complex group of symptoms that result from diseases and medical conditions affecting the bladder and urethra. The International Continence Society (ICS) classifies LUTS into 3 categories—storage, voiding, and postmicturition symptoms [1]. LUTS are highly prevalent in the general population, with a prevalence of 64.3%- 76.3% reported in previous studies [2,3]. Severe LUTS are negatively associated with quality of life (QoL) [4]; thus, proper management of LUTS is important for a patient’s welfare.
LUTS are associated with urologic diseases, such as benign prostatic hyperplasia (BPH) and overactive bladder (OAB) [5]. BPH is predominantly characterized by the proliferation of smooth muscle and epithelial cells of the prostate transition zone [6], which is one of the most common urologic problems that cause LUTS in the elderly population [5,7]. OAB is clinically defined by the presence of urinary urgency, mostly cooccurring with urinary frequency and nocturia with or without urge urinary incontinence (UUI) [8,9]. Approximately 10% of worldwide general population have OAB symptoms, and it increased to 20% in 2018 [10].
To evaluate and diagnose LUTS, patients’ medical history, laboratory tests, questionnaires, and function assessment tools, including uroflowmetry, frequency-volume chart, and urodynamic study, were analyzed [11-13]. Owing to the subjective nature of LUTS, the objective measurement of subjective symptoms by patient-reported outcome measurement tools (PROMs) is essential in the management of urologic disease and clinical studies [14]. Therefore, choosing appropriate questionnaires in a clinical trial is extremely important to determine the success or failure of clinical studies [15].
Numerous PROMs have been developed worldwide; however, none of the guidelines showed how to apply optimal PROMs for specific symptoms or diseases. In this review, we have explained previously established procedures for translation and linguistic validation of developed PROMs for other languages and suggested validated PROMs for common urologic diseases.
TRANSLATION AND VALIDATION OF THE QUESTIONNAIRE IN FOREIGN LANGUAGE PIPELINES
In most cases, using a formerly translated and validated questionnaire is a practical method to perform clinical studies and measure patient-reported outcomes (PROs). If there are no optimal questionnaires available in the Korean language, we can develop an in-house questionnaire or translate questionnaires to the Korean language. The development of the novel in-house questionnaire has many obstacles, owing to the difficulty in obtaining the validity, reliability, responsiveness, and interpretability of each statement [16]. Each process requires multiple efforts and time for in-house development using preexisting and already validated questionnaires might be a reasonable alternative. Moreover, using a multiculture, validated questionnaire is beneficial for a multinational, multicenter clinical trial design [15]. Numerous questionnaires have been developed and validated worldwide; thus, translating questionnaires in Korean is more effective than developing in-house questionnaires.
The translated version of the questionnaire does not guarantee equivalent accuracy in collecting patients’ responses to the original version. A well-translated questionnaire provides the same meaning and uses appropriate idioms for each language [17] however, it is not easy. To develop a cross-cultural equivalent of the translated and original version of the questionnaire, linguistic and psychometric validation of the translated questionnaire is essential.
The first process, linguistic validation, has to follow the principles of good practice for translation and cultural adaptation (TCA) process for PROMs, which was announced by the International Society for Pharmacoeconomics and Outcomes Research task force [18]. The TCA framework for PROMs consists of 8 core steps: preparation, forward translation, reconciliation, back translation, harmonization, cognitive debriefing, proofreading, and final reporting. In the preparation phase of the TCA, the project manager should contact the copyright holder for approval of the use and translation of the questionnaire.
Forward translation and back translation must be performed by at least 2 independent translators: a professional translator and a native speaker of the target language and source language (Fig. 1). The reconciliation phase resolves the discrepancy between the forward translators by an independent bilingual, who was not exposed to the forward translation process. The most important part of this process is the reconciliation decision reviewed by the project manager. Back translation is a quality control step performed by at least one backward translator. Harmonization is essential in the TCA process, intensively reviewing and obtaining the equivalent of the source and translated versions of questionnaires by harmonization meetings with the project manager, in-country consultant, and translators.
Cognitive debriefing is the pilot testing of the translated questionnaire to at least 5–8 responders. The project manager assesses face validity, convergent validity, discriminant validity, responsiveness, test-retest, and internal consistency of each respondent. In this step, the project manager assesses that patients exactly understand the meaning of the contexts using responder’s opinion and decide whether to revise the translated questionnaire. After reviewing and proofreading each step, the final version of the translated questionnaire is verified for use in the target country with the same quality of that of the source country questionnaire.
In the second phase of validation, psychometric validation is a more complex process for obtaining validity and reliability of the questionnaire (Fig. 2) [19]. The validity and reliability of the questionnaire need to be verified before application in clinical research. Validity refers to the accuracy, and reliability refers to the consistency of the instrument for evaluating symptom scores. The establishment of validity should be conducted in several ways, including assessing (1) construct, (2) content, (3) criterion, and (4) face validity.
Construct validity represents the relationship between the questionnaire score and the underlying hypothesis [20-22]. Content validity, also known as logical validity, indicates that the items of the questionnaire measure all components that are supposed to be measured [20]. Criterion validity measures the relationship between the novel assessment tool and the current gold standard or another validated tool [22]. Face validity is the subjective view of responders’ symptoms or patients’ QoL, which is also supposed to be measured.
Reliability is another component of the psychometric performance measure of the developed questionnaire. Internal reliability refers to the consistency within the tests, which can be measured using statistical methods of Cronbach’s alpha [20]. External reliability refers to the consistency between the tests, which can be assessed by test-retest reliability and interrater reliability [19,20]. After confirming linguistic validity and psychometric validity, the foreign language questionnaire is fully validated.
OVERVIEW OF THE AVAILABLE QUESTIONNAIRE FOR KOREAN PATIENTS WITH LUTS
From the late 1990s to early 2000, several novel anticholinergic drugs were introduced in the market and multinational, multicenter clinical trials were actively performed. The desire of the Korean Continence Society to participate in clinical trials on patients with OAB has increased. To quantify patients’ subjective LUTS and their QoL for multinational clinical trials, validated Korean versions of international LUTS questionnaires were a prerequisite. However, no questionnaires had been validated by the society, and there was no experience of conducting validation of the questionnaires. In the early 2000s, Dr. SeungJune Oh was interested in validating the Incontinence Quality of Life (I-QOL) questionnaire on a personal level and directly contacted Health Research Associates Inc., Mountlake Terrace, WA, USA to conduct the full linguistic validation process to complete the Korean version of the I-QOL [23]. I-QOL was the first fully validated Korean version of the urinary incontinence questionnaire that was validated using a formal professional validation process. Based on his experience with the full linguistic validation process, Dr. Oh suggested to the Korean Continence Society (KCS) that he would organize taskforce teams to validate the Bristol Female Lower Urinary Tract Symptom (BFLUTS) [24] and King’s Health Questionnaire (KHQ) [25] in Korean. The Korean versions of the KHQ and BFLUTS questionnaires are the first questionnaires that were linguistic and psychological validated by a multicenter study with the involvement of the KCS. Since then, validation studies of many other questionnaires initiated by Dr. Oh have also been conducted at the society or personal level. Consecutively, until the late 2000s, many international questionnaires required for clinical research have been validated in Korean, such as OAB questionnaire (OAB-q) [26], International Consultation on Incontinence Questionnaire (ICIQ), ICIQ-OAB, ICIQ for nocturia (ICIQ-N), ICIQ for male LUTS (ICIQ-MULUTS), Interstitial Cystitis Symptom Index (ICSI), and Interstitial Cystitis Problem Index (ICPI) [27], Urgency Perception Score (UPS) [28], Patient Perception of Bladder Condition (PPBC), Indevus Urgency Severity Scale (IUSS) [28], and OAB symptom score (OABSS) [29]. Since 2010, Korean versions of Patient Perception of Intensity of Urgency Scale (PPIUS), Pelvic Pain and Urinary/Frequency (PUF) [30], Urinary Tract Infection Symptoms Assessment (UTISA), Benefit, Satisfaction, and Willingness to Continue (BSW) [31], Overactive Bladder Satisfaction with Treatment Questionnaire (OAB-SAT-q) [32], and Expanded Prostate Cancer Index Composite for Clinical Practice (EPIC-CP) have been validated. The validated questionnaires obtained through this process played a decisive role in upgrading the level of clinical research on lower urinary tract dysfunction in Korean academic society.
Currently, there are approximately 30 available questionnaires for Korean patients with LUTS and neurogenic bladder (Table 1). The questionnaires focus on (1) objectifying the subjective symptoms, (2) scoring the degree of discomfort, and (3) assessing QoL and satisfaction. Most translated questionnaires are linguistically but not psychometrically validated. All questionnaires were freely accessible on the “Korean Continence Society (KCS)” official website (http://www.kcsoffice.org).
Benign Prostatic Hyperplasia
BPH is one of the most common urologic diseases in elderly men. To analyze symptoms of BPH, the International Prostate Symptom Score (IPSS), American Urological Association Symptom Score, Danish Prostate Symptom Score (DAN-PSS), BPH impact index, and ICS male [20] questionnaires are being used worldwide. There were no questionnaires that completed the validation process for translation. DAN-PSS is the only validated questionnaire but does not cover LUTS, only the sexual function component [33]. Among the BPH symptom questionnaires, only the IPSS has a Korean version. Although the translated version of the IPSS has not yet been confirmed by the linguistic validation process, it is widely used and accepted by Korean urologists. The IPSS has 5 statements for symptoms and one statement for QoL. The symptom statements score the severity from 0 to 5, and the QoL statement scores the severity from 0 to 6. The sum of the total values of IPSS classified symptom severity was mild (0-7), moderate (8-19), and severe (20-35).
Overactive Bladder
OAB is common in both sexes, and the diagnosis of OAB is primarily based on patients’ storage symptoms. Severe OAB causes UUI, which is related to aa reduction in QoL. Several questionnaires have been developed for measuring storage symptoms, incontinence, and QoL, which are important in the diagnosis, treatment, and surveillance of OAB.
The OAB-q was developed to cover all OAB symptoms and QoL problems for both sexes [34]. This questionnaire has 33 statements; thus, obtaining a questionnaire in every clinical practice is extremely time-consuming. To resolve this problem, the OAB-q short form was developed to capture the full statements of OAB-q in a shorter time and with good reliability, validity, and responsiveness [35]. OAB-q is a self-filling type of questionnaire that consists of 2 major domains for measuring the modified OAB—a validated 8-question screen (OAB-V8) and health-related QoL (HRQoL). Each OAB-V8 statement was scored from 0 to 6, and 2 points were added to the male responders. Clinicians diagnose OAB when the sum of the 8 statement scores is >8. The HRQoL domain has 4 subdomains: coping, concern, sleep, and social interaction. The OAB-q questionnaire translated and linguistic validated in over 14 languages [36]. Psychometric property of the Korean version of OAB-q was validated in 2012 [26].
The OABSS is a simplified version of the questionnaire that was developed in 2006. Homma et al. [37] developed this patient-scoring system, which is easy to use. The main concept of OABSS is based on core symptoms of OAB based on the ICS definition [9]—urinary urgency, usually with urinary frequency and nocturia, with or without UUI. Because OABSS has only 4 items, it is extremely easy to apply in a daily clinical practice with acceptable reliability. The severity of OAB symptoms is commonly classified according to the OABSS score —mild (≤5), moderate (6-11), and severe (≥12) [38]. The Korean version of the OABSS was validated both linguistically in 2008 [29] and psychometrically in 2014 [39].
General Symptom Assessment
The BFLUTS [21] questionnaire, which was developed to assess symptom severity, QoL, and treatment outcome, focuses on female LUTS and urinary incontinence. The BFLUTS questionnaire is self-administered by patients. The items of the BFLUTS questionnaire are based on the ICS male questionnaire [20] which was developed to measure the QoL and discomfort in male patients with LUTS. The BFLUTS questionnaire consisted of 4 domains (LUTS severity, sexual function, discomfort, and QoL) and 34 statements. The Korean version of the BFLUTS questionnaire was translated and validated in 2004 [24]. The BFLUTS results show good correlation with the pad test and frequency-volume chart; however, it takes a long time, because of the number of questions. A shorter version of the BFLUTS questionnaire is available: incontinence (BFLUTS-IS), voiding (BFLUTS-VS), filling (BFLUTS-FS), sexual function (BFLUTS-sex), and QoL (BFLUTS-QoL) [40]. There was no information on the reliability of the BFLUTS questionnaire in male patients with OAB.
The KHQ was developed by King’s College Hospital to measure the QoL of female patients with incontinence. KHQ is a self-filling type of questionnaire that covers a broad range of QoL, impact on life, and general patients’ perception of OAB [41]. The full version of the KHQ consisted of 9 domains (general health perception, impact on life, role limitation, physical limitation, social limitation, personal relationship, emotion, sleep/energy, incontinence severity) and 21 items. The KHQ is one of the most widely used questionnaires translated in various languages. The linguistic validation of the Korean version of the KHQ was performed in 2005 [25] and the reliability of KHQ for OAB symptoms was validated for both sexes [42].
Urgency
Urgency is the core symptom of OAB, and the severity of urgency is directly associated with QoL [43]. Several questionnaires have been translated and linguistically validated in Korean.
The IUSS questionnaire focuses on urgency [44]. To reduce recall bias, IUSS measures patient-reported subjective symptoms by 4 degrees of severity: (1) no sensation of urgency, (2) awareness of urgency but easily tolerated, (3) urgency that is somewhat uncomfortable, and (4) extreme urgency with discomfort. The IUSS questionnaire shows reliable results when applied in randomized clinical trials to measure improvement in urgency severity in Korean patients [45].
The PPIUS is a single-item questionnaire focused on urgency symptoms and well-demonstrated symptom severity covering urgency and UUI [46]. The linguistic and psychometric validation of the Korean version was conducted.
The UPS is another single-item questionnaire to determine patients’ perception of urgency and desire to urinate, which was developed in 2005 [47]. It was translated by the KCS and Pfizer Inc., and both Korean versions of the questionnaire were provided on the KCS website.
QoL Assessment
There are several Korean versions of the questionnaires available for QoL assessment in patients with OAB. I-QOL and the ICIQ are mainly focused on QoL assessment. The KHQ and OAB-q questionnaire has QoL domains that were validated for OAB in both sexes. The BFLUTS questionnaire also has a QoL domain and was validated only for female patients with OAB.
I-QOL is a self-reporting questionnaire for patients with incontinence, which was developed in 1996 [48]. The I-QOL items consisted of 3 subscales (avoidance and limiting behavior, psychosocial impacts, and social embarrassment) and 22 questions. I-QOL was translated and validated in 15 countries [49] and the Korean version of the I-QOL was linguistically validated in 2002 [23]. Psychometric property of the Korean version of the I-QOL was validated in 2009 [50]. I-QOL can be used to assess OAB symptoms in both sexes [51].
ICIQ is another self-completion questionnaire used to assess urinary incontinence and its impact on QoL [52]. ICIQ provides several different modules to measure specific symptoms of LUTS: ICIQ-OAB for OAB, ICIQ-N for nocturia, and ICIQ-MULUTS for male LUTS. The Korean version of the ICIQ was validated in 2006.
Other PROMs
The Patent Perception of Bladder Condition Measure [53] was developed to measure urinary incontinence. The European Medicine Evaluation Association recommends using the Patient Perception of Bladder questionnaire to evaluate OAB symptoms. The PPBC questionnaire shows reliable results in using OAB symptom assessment in 2 clinical trials [53]. The linguistic and psychometric validation of the Korean version of the PPBC was performed, and the available form is available on the KCS website.
The BSW questionnaire is focused on measuring PROs after OAB treatment. The BSW questionnaire was developed by Pfizer Corporation and consisted of 3 domains of PROs: treatment benefit, satisfaction with treatment, and willingness to continue treatment [54]. The BSW questionnaire successfully demonstrated the global impression of the net benefits of treatment outcomes for both clinical research and daily practice. The statements of the BSW reflect patients’ attitudes and satisfaction with treatment. This questionnaire also shows the patient’s compliance with medication [31]. The Korean version of the BSW is available with linguistic validation [31].
The Treatment Satisfaction Visual Analog Scale (TS-VAS) [32] and OAB-SAT-q [54] measure patients’ satisfaction with their treatment on OAB. The TS-VAS uses a 100-mm-long horizontal line, and a patient makes a vertical mark on the line to convey their satisfaction of the treatment [55]. OAB-SAT-q contained 11 questions across 6 domains: effectiveness, convenience, side effects, preference of treatment, willingness to undergo treatment again, and global satisfaction. Both the TV-VAS and OAB-SAT-q were available for linguistic validation [32].
Incontinence
Urinary incontinence is an involuntary loss of urine that is classified by stress urinary incontinence (SUI), UUI, mixed urinary incontinence, nocturnal enuresis, and postural, continuous, insensible, and coital incontinence. Urgency can be combined with UUI; thus, most OAB questionnaires (KHQ, I-QOL, BFLUTS, ICIQ, and OABSS) contained incontinence domains. Both SUI and OAB are common in older females; thus, they can coexist. Several LUTS questionnaires have been validated for application in SUI. The KHQ obtained psychometric validation for application [56] and the ICIQ-UI short form is the most widely used questionnaire for patients with SUI [57].
Adult Neurogenic Lower Urinary Tract Dysfunction
Adult neurogenic lower urinary tract dysfunction (ANLUTD) is an abnormal or difficult lower urinary tract function in the context of clinically confirmed neurogenic disorders [58]. Patients who need clean intermittent catheterization (CIC) for bladder emptying in ANLUTD is up to 56% in spinal cord injury and 75% with multiple sclerosis [59]. To assess PROMs of patients who perform CIC, the intermittent self-catheterization questionnaire is available in the Korean version with linguistic validation [60]. The English version of the Qualiveen questionnaire, originally developed to measure the specific impact of urinary problems on the QoL of patients with neurogenic bladder, was linguistically validated in Korean in 2010 [61].
Although patients’ QoL was severely altered in ANLUTD, there were no standard PROMs. Recently, the neurogenic bladder symptom score was developed and validated in the English version [62]. This questionnaire targeted the measurement of discomfort and QoL of ANLUTD, which can be a potential candidate for standard clinical use in patients with ANLUTD.
ISSUES IN QUESTIONNAIRE INTERPRETATION AND ANALYSIS
There were several issues regarding the interpretation of the questionnaire results. First, most questionnaires used a Likert scale, which is one of the most widely used psychometric scales to measure the responders’ attitude by several degrees of agreement with a statement. The Likert scale has an innate risk of bias that the researcher should be cautious of before analysis. If the variables show biased high estimates, patients tend to answer only low scores only (floor effect). In contrast, if the variables show biased low estimates, patients tend to answer high scores only (ceiling effect). Despite the changes in the independent variables, the changes in the dependent variable are not measurable, considering the ceiling or floor effect. In this situation, it is difficult to analyze the distribution of each degree because the score distribution is too low or too high. Therefore, the investigators must consider consistency and reliability before and after the tests for control and experimental groups and modify the statement for analysis.
Second, previous studies have shown that symptom scores and QoL do not directly reflect the severity of disease status. For example, BPH symptom scores do not accurately reflect the severity of LUTS, and several studies comparing questionnaire scores and urodynamic findings showed low correlations with urination muscle activity and bladder outlet obstruction [63-65].
Future Perspectives
There are still numerous questionnaires to be translated and validated. For example, detrusor underactivity is a complex of symptoms and bladder emptying problems related to urodynamic study-based abnormalities [66]. The prevalence of detrusor underactivity is reported in up to 40% of males and increases with age [68,68]. Although the importance of detrusor underactivity has been recently highlighted, no validated PROMs have been widely used [69,70]. Developing or translating the PROMs for sexual dysfunction in females is another issue. Severe LUTS can be correlated with sexual dysfunction in both males [71] and females [72]; however, there is still no standard questionnaire that directly measures female LUTS-related sexual function. The questionnaire focused on LUTS-related vaginal symptoms, and ANLUTD also needs to be validated.
CONCLUSION
In this review, we have summarized the established validation processes of the foreign language questionnaire before clinical use and explained the currently available questionnaires in Korean for LUTS. It is important to clearly determine the characteristics and limitations of each LUTS questionnaire to select the appropriate questionnaire for the clinical trial.
Notes
Conflict of Interest
No potential conflict of interest relevant to this article was reported.
AUTHOR CONTRIBUTION STATEMENT
·Conceptualization: SYC
·Data curation: JS
·Formal analysis: SJO
·Project administration: SYC
·Writing-original draft: JS
·Writing-review & editing: SJO