Figure 1.9 Comparison of 3 lipsticks demonstrating high formulation transfer (A) and low formulation transfer (C). Each panelist kissed the white paper four consecutive times without reapplication, as seen through prints 1-4 [58].
1.4 Relation of In Vitro Analysis with Consumer Sensory Testing
Sensory tests are commonly used in a wide variety of fields, especially in the cosmetics field, in order to gain a better understanding as to the product’s performance and perception made by the consumer. In doing so, the product under development can be further improved and optimized whilst being compared to similar competitor products. Sensory testing can be conducted in three different methods: discriminative, affective, or descriptive [58]. Discriminative testing is typically used to screen for a detectable difference between two products, whether of the same brand or that of a competitor. Affective testing determines the consumer’s acceptance or preference towards a particular product. Descriptive testing provides more detail in regards to characteristics of a product. While each of these methods provides useful information to formulators, each is restricted in terms of information provided or ease of analysis.
Currently, there is limited information available relating data obtained from the in vitro characterization of lipsticks directly to that observed during consumer sensory tests. Typically, products are evaluated and then passed on for consumer testing. These tests aim to target specific characteristics of the formulation which may correlate with quantitative measurements made in a laboratory setting. While this is useful for relating the quantitative characteristics described by consumers, it does not quite indicate if the product will perform well in terms of the consumer actually liking the product. Consumer sensory tests are also restricted in the sense that only three to four formulations can be tested at a time due to the limited availability of panelists, time restrictions, and cost; therefore, it is imperative that the best rated lipsticks in laboratory screenings are selected.
The sensory evaluation is typically conducted in a controlled environment, where the temperature, humidity, and lighting can be regulated. Panels may be untrained or trained, depending on the scope of the study being carried out. A reliable sensory panel is necessary when investigating in order to allow for study reproducibility and consistency. Typically, this can be achieved by establishing an agreement between panelists regarding terms, definitions, and characteristic descriptions prior to product use; this allows for members of the panel to be consistent with their grading and ensures that each panel is analyzing fairly. A new approach for enhancing panel reliability has been described by Latreille et al.; this utilizes a mixed linear model which tests the panel’s reliability through various criteria [62]. This model measures both an individual’s reliability in addition to the group as a whole by looking at discrimination ability, repeatability, and agreement/disagreement using contrasts [62]. Though this method provides an accurate representation as to the panel’s reliability and reproducibility, it is restricted in the sense that it only works well for small panel groups with few products to evaluate; this is because as more products are added, results become much more complex [62]. Larger panels with more products are recommended to use other means to evaluate panel performance in relation to consistency of descriptors between products [62]. In short, it is imperative to be able to obtain repeatable and dependable results from such in vitro studies, as these help shape the evolution of a product’s formulation. These results also help scientists in translating findings from in vitro tests to product performance in panels, thus making it even more imperative that the findings are accurate.
In order to properly assign scores to lipstick products, studies typically include multiple sessions that help users precisely define key characteristics and develop sensory language to prevent any misinterpretations. For example, a study conducted by Yap and Aminah conducted five separate sessions with the panelists; these are shown in Table 1.2 [63].
These panels are typically controlled in a few different aspects, the first being the environment in which the test is being conducted. Secondly, the desired key outcomes or findings are used to shape panelist questions and discussion points. For example, a panel of 50 lipstick users was used to compare transfer-resistant lipstick formulations with that of the user’s regular lipstick through two different tests [64]. The first test evaluated the transfer of the lipstick 30 minutes after application; panelists patted their lips with a white cloth and were asked if an acceptable amount was removed from the process, where 98% said yes [64]. Panelists were also asked if the amount was significant (to which 94% stated insignificant) and asked to compare to regular lipsticks (to which 92% stated the amount was less) [64]. The second test followed the same suit as the prior one, but after the 30-minute drying period, panelists were asked to drink water. 100% stated an acceptable amount stayed on the lips, where 98% said the amount removed was insignificant and 96% claimed the amount removed was less than regular lipstick [64].
Table 1.2 Summary of session objectives in sensory panels [63].
Session | Objective |
Session 1 | Development of a list of characteristics based on personal perception of the product. |
Session 2 | Defining key terms that translate to sensory attributes. |
Session 3 | Development and use of scoring system for formulations. |
Session 4 | Repeat session 3 for consistency. |
Session 5 |
A study conducted by Rafferty et al. investigated the ability to predict lipstick sensory products through the use of in vitro testing. Here, 12 different lipstick formulations were studied in regards to sensory perceptions (spreadability, stickiness, opacity, and gloss) with a panel of nine users and in regards to in vitro performance parameters (friction test, application test, tack test, and color development retention test) [65]. From statistical modelling, it was found that the in vitro analysis correlated well with findings from the sensory panel; more specifically, results from the friction test alone were relatable to sensory panel findings in regards to spreadability, stickiness, opacity, and gloss [65]. This is because the friction tests were able to separate the products as creamy, waxy, oily, or sticky from which then explained the adhesion and cohesive properties of each lipstick sample [65].
Ultimately, there is a tradeoff that needs to be made by consumers in regards to achieving the ‘perfect’ long-wear lipstick that is durable and comfortable. Most long-wear lipsticks with claims of 12+ hours of wear tend to be a two-step application process which may be inconvenient for a particular customer base who is always on the go. If a consumer prefers for the product to rank higher in terms of comfort, then the lastingness of the product may suffer. This is because long-wear lipsticks that rank high in comfort typically have a gloss component that introduces moisture to the lip, reducing the tightening and dry sensation experienced by the wearer. These glosses, however, are more likely to be transferred from the lip. Products with better durability tend to be matte films which may feel dry to the consumer.
1.5 Summary
Although an abundance of methods may be used to help characterize long-wear and transfer-resistant lipsticks,