How To Create A Personality Test

Learn how to create any kind of personality inventory or cultural fit assessment from very basic to highly complex.

Personality tests are widely used far beyond psychology; they play an important role in hiring, leadership development, and workplace decision-making. A survey of 84 large organizations (with an average annual revenue of 45.7 billion dollars) revealed that:

  • 66% use personality testing for “high potential” employees
    • 57% use it for senior executives

      Today, personality testing is the second most common pre-employment assessment practice, and 74% of US-based HR professionals believe that it provides value. With demand rising, the ability to create a reliable personality test is now a crucial competitive advantage for businesses.

      Key takeaways

      • Personality tests measure personality traits through structured self-report questionnaires, typically based on rating scales.
        • Reliable personality testing requires careful trait mapping, scoring algorithms, norm groups, and benchmark comparisons.
          • TestInvite enables organizations to create personality tests with custom scoring, automated trait mapping, tailored reports, and benchmarking tools.
            • A valid and reliable personality test must ensure consistency (reliability) and accurate trait measurement (validity) to produce meaningful results.
              • Personality assessments support data-driven talent and culture decisions, helping organizations understand how individual traits align with roles or teams.

                What is a Personality Test?

                A personality test is a method of assessing human personality constructs (aka personality trait dimensions, facets, factors, or aspects).

                Personality assessment instruments are introspective self-report questionnaires that consist of rating scales (a series of multiple-choice questions). That is why personality inventories are classified as Subjective Tests. The output depends on the examination of one's own conscious thoughts and feelings, which is a subject’s self-evaluation.

                History of Personality Testing

                The first personality assessment measures were developed in the 1920s and were intended to ease the process of personnel selection (aka. Hiring, Recruitment), particularly in the armed forces. Since these early efforts, a wide variety of personality scales and questionnaires have been developed.

                How can you create a personality test with TestInvite?

                1. Add Effects & Map Traits to Questions

                The Multiple Choice Question Editor allows you to map each option to one or many traits (aka. effects). That allows to track & report automatically each option.

                For example in the screenshot below, you will observe that the options of the questions are mapped to Agreeableness/Politeness trait with a different scoring each.

                Personality Test Question Example
                Personality Test Question Example
                • If Very Inaccurate is selected, Agreeableness/Politeness will be increased by 1
                  • If Moderately Inaccurate is selected, Agreeableness/Politeness will be increased by 2
                    • If Neither Accurate nor Inaccurate is selected, Agreeableness/Politeness will be increased by 3
                      • If Moderately Accurate is selected, Agreeableness/Politeness will be increased by 4
                        • If Very Accurate is selected, Agreeableness/Politeness will be increased by 5

                          2. Create Custom Reporting Template

                          Each personality test is different. Hence the reporting. That's why Testinvite allows you to have your own reporting template. The template is a simple HTML template with placeholders.

                          Custom Reporting Templates for Personality Tests
                          Custom Reporting Templates for Personality Tests

                          3. Create Norm Groups & Benchmarks & Match Scoring Algorithm

                          You can create your own function (algorithm) to create norm groups, benchmarks and match scores.

                          Custom Functions for Personality Tests
                          Custom Functions for Personality Tests
                          Reporting with Norm Groups & Benchmarks
                          Reporting with Norm Groups & Benchmarks

                          How are personality tests scored?

                          A personality test is a self-report inventory that requires respondents to introspectively assess their own personality characteristics by indicating their level of agreement with each item. An item on a personality test, for example, might ask respondents to rate the degree to which they agree with the statement “Don’t talk a lot” on a scale from 1 (“strongly disagree”) to 5 (“strongly agree”).

                          Since these items don’t have right or wrong answers, their value comes from how responses are scored and mapped to personality traits.

                          1. Scoring of Personality Tests

                          A personality test is a self-report inventory which requires respondents to introspectively assess their own personality characteristics by indicating their level of agreement with each item. An item on a personality test, for example, might ask respondents to rate the degree to which they agree with the statement "Don't talk a lot" on a scale from 1 ("strongly disagree") to 5 ("strongly agree")

                          Example Personality Test Question
                          Example Personality Test Question

                          Unlike other kind of assessments (such as aptitude tests or language tests) questions in Personality Tests do not have a correct / incorrect answer. However;

                          • Each option in the scale is mapped to a Trait (or a series of Traits) by a different scoring. For instance the option "Strongly Agree" may be assigned to the Extraversion Trait by a score of 1 (or 5 if the keys are reversed).
                            • Traits may be structured (aka. grouped by) in a tree-like form so that when an option that has been assigned to a trait has been selected by the respondent; both the trait and the parent trait(s) should be affected by the score. For instance in the IPIP NEO PIR Test, the traits Activity Level, Assertiveness, Cheerfulness, Excitement-Seeking, Friendliness and Gregariousness are grouped by the Extraversion Trait.
                              IPIP NEO PIR Test Traits Hierarchy
                              IPIP NEO PIR Test Traits Hierarchy

                              So when the respondent answers a question, the scores of the traits that are assigned to the selected option are affected. When the respondent answers all the questions in the personality test, all traits will have their own scores calculated.

                              Personality Test - Scores for Each Trait & Parent Trait
                              Personality Test - Scores for Each Trait & Parent Trait

                              2. Using Norm Groups to Derive Results

                              Even thought personality tests are scored by scoring respondent's answers for each trait, the scores for each trait do not say much about the respondent personality by itself. (Personality test scores are difficult to interpret in a direct sense). That's why the scores for each trait must be compared with the rest of the population who have already completed the personality test. This is where the norm group comes into play. Norms provide a comparative basis for interpreting a respondent's test scores.

                              Common formats for these norms include percentile ranks, z scores, sten scores, and other forms of standardised scores. By using a norm group, the test administrator can gage how your traits (for example Extraversion) compares to the rest of the population.

                              How to interpret individual scores?

                              To interpret individuals' scores, one might calculate the mean and standard deviation (SD) for a sample of persons, usually of the same sex and a particular age range, and interpret

                              • scores within one-half SD of the mean as "average."
                                • scores outside that range can be interpreted as "low" or "high."

                                  If the scores are normally distributed, this would result in approximately

                                  • 38% of persons being classified as average
                                    • about 31% as low
                                      • and 31% as high

                                        An alternative method for showing respondents where they stand with respect to a group of respondents is to divide a set of scores into five equal parts, which are called quintiles. Labels for the scale anchors describe the lowest 20% and highest 20%, the label "average" is used for the middle 20%, and the remaining quintiles are labeled "somewhat." For example, labels for the quintiles on a scale that ranges from introversion to extraversion would be introverted, somewhat introverted, average, somewhat extraverted, and extraverted.

                                        3. Profile Matching & Benchmarks

                                        Norm Groups are used to derive meaningful results given a set of trait scores. However when using personality tests or cultural fit assessments for any kind of selection process (employee selection, school selection, career selection etc..), using Norm Groups are not enough. Because it is not enough to know about the personality of the respondent, but also will you need to find out how much the personality fits into the position/job or organization's culture.

                                        In order to derive such conclusions, benchmarks - for examples job specific benchmarks - that assess how good a fit a person's personality is for a given position should be created. The benchmarks calculate a person's job match score by comparing their personality trait scores to composite profiles created

                                        4. Score Checks & Correction for Faking

                                        To derive dependable results, there will usually be some kind of ‘score check’ involved, so that the test administrator can see things such as how consistently the respondent answered the questions or whether he/she just went for the neutral option on most of them.

                                        What makes a personality test reliable and valid?

                                        A personality test is considered reliable when it produces consistent results over time and across different situations. Reliability ensures that if a person retakes the test under similar conditions, their scores remain stable, and the results are not influenced by random errors or temporary factors. This consistency can be evaluated through methods like test–retest reliability, internal consistency of items, and agreement between different scorers when human judgment is involved. Without reliability, it becomes unclear whether the test is measuring stable traits or just random fluctuations.

                                        A test is considered valid when it accurately measures what it claims to measure. For personality assessments, this means the test must truly capture underlying traits, like extroversion or conscientiousness, rather than unrelated characteristics. Validity can be demonstrated in several ways, such as content validity (covering all aspects of a trait), construct validity (aligning with psychological theory), and criterion validity (predicting real-world outcomes, like job performance).

                                        What to look for in software for personality testing?

                                        If you want to create and deliver personality tests online, you will need an online assessment software that is capable of

                                        • Creating a Trait Hierarchy
                                          • Creating Multiple Choice Questions having each option assigned (aka. mapped) to one or multiple traits.
                                            • Creating custom reports for each test
                                              • Creating custom matching algorithms (aka. functions) for benchmarking that returns match scores.
                                                • Allowing you to create a customizable algorithm (aka. function) for each test, to handle the correction for faking.

                                                  Widely Used Personality Inventories

                                                  Most of the instruments used today, such as:

                                                  • Minnesota Multiphasic Personality Inventory (MMPI)
                                                    • The Sixteen Personality Factor Questionnaire (16PF)
                                                      • The Comrey Personality Scales
                                                        • Revised NEO Personality Inventory (NEO PIR)

                                                          These are based on the Big Five and Five Factor Model of personality.

                                                          Market Size and Use Cases

                                                          Estimates of how much the personality assessment industry in the US is worth range anywhere from $2 and $4 billion a year (as of 2013). Personality assessment is used in a wide range of contexts, including:

                                                          • Individual and relationship counseling
                                                            • Clinical psychology
                                                              • Forensic psychology
                                                                • School psychology
                                                                  • Career counseling
                                                                    • Employment testing
                                                                      • Occupational health and safety
                                                                        • Customer relationship management

                                                                          Popularity and Limitations

                                                                          Today, questionnaires have gained huge popularity across the globe in the area of personality assessment because of its ease of construction and administration, despite its serious limitations such as social desirable responses, judgmental errors, lack of adequate self-insight (or biased perceptions of others) and faking. (Faking is defined as deliberate provision of “inaccurate responses to personality items in a manner that they believe will increase their chances of obtaining valued outcomes, such as a favorable hiring decision”).

                                                                          Created on 2023/04/18 Updated on 2025/10/15 Share
                                                                          Mustafa Ekim, founder of TestInvite and QuizCV, brings nearly a decade of expertise in building online assessment platforms for custom, secure tests.
                                                                          Go Back

                                                                          Talk to a representative

                                                                          Discover how TestInvite can support your organization’s assessment goals