MOD: Textio

An AI-powered writing assistant tool that enhances job adverts by analysing and optimising language for inclusivity, engagement and effectiveness.

Tier 1 Information

Name

Textio

Description

Textio is an AI-powered writing assistant tool that enhances job adverts by analysing and optimising language for inclusivity, engagement and effectiveness. It provides real-time feedback and predictive analytics, suggesting alternative phrases, to eliminate bias and improve readability. This tool was introduced to improve the quality of job adverts, minimising issues such as unexplained acronyms and long and difficult to understand content blocks. Along with other complementary recruitment process improvements, Textio ensures that our adverts provide content on career opportunities, that we are clear on the skills we are looking for, and effectively promote the range of opportunities available in Defence.

Website URL

https://www.textio.com/

Contact email

[email protected]

Tier 2 - Owner and Responsibility

1.1 - Organisation or department

Ministry of Defence, Defence People Team

1.2 - Team

Defence People Civ HR- Workforce Planning and Resourcing

1.3 - Senior responsible owner

Deputy Director, Workforce planning and resourcing

1.4 - External supplier involvement

Yes

1.4.1 - External supplier

Textio, Amazon AWS, Amazon Guard duty

1.4.2 - Companies House Number

US company, not registered on companies house

1.4.3 - External supplier role

Textio is responsible for the engineering of its tool by designing, developing and maintaining the software and algorithms that power its language optimisation features. Amazon AWS and Amazon Guard Duty are considered as part of Textio’s infrastructure. AWS provides the cloud computing resources necessary for processing large datasets and running machine learning models. Amazon Guard Duty offers security monitoring to protect data and ensure compliance with security standards.

1.4.4 - Procurement procedure type

Head Office Corporate Services (HOCS) commercial outlined using the Crown Commercial Framework to deliver this requirement, this was the preferred option. However, following review there was not a suitable framework in place at the time which fully met the requirement. The requirement was openly tendered enabling CIV HR to assess and select the most economically advantageous supplier against the statement of requirement, this was unsuccessful and no contract was awarded. Textio was the only remaining option, all avenues were explored. As it is actively used by other government departments, Textio was then awarded a direct contract which was processed by HOCS commercial to ensure that MOD terms and conditions are adhered to.

1.4.5 - Data access terms

MOD grants Textio a non-exclusive, worldwide, royalty-free license to use, reproduce, modify and make derivative works based upon the Data solely in connection with use of the Services and Textio’s provision of the Services to Company. MOD represents and warrants that MOD or MOD’s licensors own all right, title and interest in and to the Data, and that Company has all rights in DocuSign Envelope ID: 9D0C21A8-C63C-4A46-BBFB-AAD2A3C936AD- 4. Data that are necessary and sufficient to use the in connection with MOD’s Account on the Services.

Tier 2 - Description and Rationale

2.1 - Detailed description

Textio is a writing platform that optimises job postings, emails, and other recruiting materials to draw in diverse applicants with the skills we need in Defence. It analyses language patterns using machine learning and offers ideas in real-time to improve the efficacy of job advertising. Textio uses Artificial Intelligence and continuously learns from the content uploaded onto the platform.

2.2 - Scope

Textio is a web accessed online platform which assesses how effective your job advert is and suggests improvements. It is designed to improve the number, quality and diversity of candidates to help get the best person in post, first time around. It also has access to the MOD’s online advert library, and can save time by choosing to adapt an existing advert. Textio was introduced as a tool for MOD Main only (i.e. excluding arm’s-length bodies) due to the different grading, and job roles in these parts of the organisation. It is also designed for Civilian roles only.

2.3 - Benefit

The tool was introduced to improve the inclusiveness and effectiveness of adverts, enhancing the Department’s ability to attract and retain a more diverse workforce. The tool also helps us to simplify language in job adverts, improve understanding of what we are looking for in our organisation and this is one part of the jigsaw of continuous improvement to our ability to attract and retain staff.

2.4 - Previous process

Prior to the introduction of Textio, recruiting managers were solely responsible for the creation of the job adverts, this process was supported by internal guidance provided by the CIV HR Recruitment policy team and quality assurance was administered by Defence Business Services (DBS). Introduction of this tool is part of a suite of activity to improve our ability to attract and retain staff.

2.5 - Alternatives considered

Success in recruitment activity is down to a number of factors throughout the recruitment process, from our social media presence, job adverts and the end-to-end recruitment process, therefore more than one intervention is required to optimise our ability to attract and retain the right people with the right skills at the right time. Other initiatives were implemented to strengthen the quality of job adverts, improve applicant engagement and diversity of successful candidates. These include:

  • Development of the Recruitment & Resourcing Toolkits to guide managers on the end-to-end process including attraction
  • Development of additional guidance e.g. How To Write A Better Job Advert, How To Assess Using CVs and Recruiting Line Manager checklist
  • Policy bites (video content hosted on our internal intranet (Defnet)) “How To Write A Better Job Advert and Recruitment”
  • Implementation of the new CS Jobs system, VX (offering a simpler and user-friendly system) and DBS How to guides on using VX (e.g. creating an advert)
  • LinkedIn advertising for external roles, enabling adverts us to reach a wider audience
  • Removal of Minimum Entry Qualifications (no longer stipulating certain qualifications depending on grades across all adverts)
  • Monthly Network working group sessions (comprising of reps across MOD) to share best practice when recruiting and filling vacancies

Tier 2 - Decision making Process

3.1 - Process integration

Recruiting managers copy text from an advert created on the CS jobs platform and populate it within Textio. By providing managers with this tool to optimise job adverts for language inclusivity and effectiveness, it ensures that we are able to reach a broad range of potential applicants. Textio helps managers to craft engaging, unbiased adverts, aligning with recruitment efforts and MOD’s diversity and inclusion goal whilst enhancing overall candidate engagement.

3.2 - Provided information

Textio provides decision makers with detailed insights, offering overall text quality scores, tone and inclusivity analysis and word and phrase suggestions based on content uploaded onto the software by the employee. It also includes data-driven insights with performance metrics, predictive analytics, context-specific recommendations tailored to industries and audiences, real time feedback, comparative analysis through benchmarking and historical data insights helping the user to refine their writing.

3.3 - Frequency and scale of usage

Since the start of using the platform, 2556 internal users (recruiting line managers) have registered to use Textio. The quantity of data being processed will also depend on the volume of content and the frequency of usage. The software is mandated for use for all recruiting line managers, as awareness grows across the organisation in regards to the tool usage will increase.

3.4 - Human decisions and review

Textio utilises artificial intelligence and predictive analysis to analyse language patterns and provide real-time feedback. The user is then given a score out of 100 which determines readability and the level of inclusive language, a score between 80-100 is deemed to be more effective in terms of potential candidate engagement, users have the ability to add or subtract words to improve the scoring. They also have the ability to override suggestions that may not align with the MOD brand. The Employee/User makes the final decision regarding the content of the job advert.

3.5 - Required training

Textio: Employees are trained in data protection principles, data handling, data regulation and best practice. MOD employees: Intermittent training is provided to staff to ensure best practice and to enhance knowledge around the software.
MOD account manager for Textio: Cybersecurity awareness & training procedures for privileged users.

3.6 - Appeals and review

N/A

Tier 2 - Tool Specification

4.1.1 - System architecture

Textio’s architecture combines data ingestion from various sources, Natural Language Processing (NLP) to analyse job advert content to predict advert effectiveness. Providing real time feedback through a user-friendly interface, it leverages cloud infrastructure (AWS) for scalability and data processing. Security and compliance measures are integrated to protect user data and ensure it is a robust and reliable tool for optimising job postings.

4.1.2 - Phase

Beta/Pilot

4.1.3 - Maintenance

Textio employs several security measures to safeguard user data, including the encryption for secure transmission and storage, strict access controls limiting system, and database access to authorised personnel. Textio conduct regular security audits to identify and address vulnerabilities and also it has established incident response protocol for swift detection and mitigation of security incidents. Textio ensures that it complies with data protection regulation (GDPR) and DEFCON 532b.

4.1.4 - Models

  • Rule-based model: identifies and flags biased and non-inclusive language.
  • Machine learning: uses extensive datasets to predict the effectiveness of different language choices and its impact on candidate engagement.
  • Statistical: analyses trends and patterns in job postings providing benchmarking against industry norms.
  • Natural language processing (NLP): analyses and understands text to offer real-time suggestions for improvement.

Tier 2 - Model Specification

4.2.1 - Model name

Textio uses a wide range of AI techniques and tools, including regularized linear and logistic regression, topic modelling (LDA), conditional random fields (CRFs), convolutional neural networks (CNNs), GLoVe, ELMo, BERT, DistilBERT, word2vec, and tok2vec. Textio uses prompt engineering over GPT-3.5 for generative AI features and an internal, Textio-proprietary scoring system to post-filter generated text.

4.2.2 - Model version

For AI models, GPT-3.5 turbo is the version. Since Textio’s product is comprised of many different ML models, these are kept up-to-date to the latest versions.

4.2.3 - Model task

The AI model designed by Textio is focused on assisting users in writing various documents like job postings and recruiting emails.

4.2.4 - Model input

The AI model’s input for Textio includes various types of data sources used for training and operation, such as job postings, recruiting emails, public sources, survey responses, and expert linguistic annotations. The input text from writers is masked to exclude personal information, and the training data is segmented to focus on relevant text data collected through the Textio Data Exchange Program. Customers must explicitly opt into the Textio’s Data Exchange program.

4.2.5 - Model output

AI-Generated Text: Scored internally for quality, with low-quality output flagged and corrected. Model Predictions: Combined to provide user-understandable scores and guidance. Continuous Retraining: Models are continuously retrained with new data to maintain effectiveness.

4.2.6 - Model architecture

Textio uses a mix of Natural Language Processing (NLP), Machine Learning (ML), and heuristic approaches in its AI models. The AI model also utilizes deep learning techniques such as ELMo and BERT. Textio models are composed of multiple layers of sub-models to enable introspection and user-facing guidance. All documents are featurised using an NLP data pipeline. Heuristic models layer reasoning over these NLP features. Models that use ML are trained against proprietary datasets, where models learn to predict effectiveness of writing based on observed outcomes.

A vast majority of these are traditional ML algorithms such as SVMs, Random Forest, LDA-based topic modelling, and regression models. These include supervised (19) and unsupervised (1) traditional models. We have 1 transformer-based LLM/AI model based on GPT-3.5 turbo.

4.2.7 - Model performance

To ensure the models are accurate and performant for a given task, Textio employs linguist analysts to hand-label new data to test our ML features against on a regular basis. We also have continuous monitoring of our services, including ongoing, regular unit testing and automatic evaluations of our models to check for model decay or drift.

Textio’s team tests the predictions of every new model version against a held-out set of the most recent known hiring and survey outcomes, sampled from across representative industries and data sources. To ensure that custom-built models are accurate and performant for a given task, Textio’s team tests the predictions of every new model version against a held-out set of the most recent known hiring and performance feedback outcomes. Textio does not share its results.

4.2.8 - Datasets

The data sets used to develop the AI model at Textio include: - Job postings with time to fill, applicant numbers, and demographics - Recruiting emails and responses - Data from public sources - Proprietary licensed data sets - Internally labelled data - Customer data per service agreements - Survey responses - Expert linguistic annotations

4.2.9 - Dataset purposes

The data used for training Textio’s ML systems is hand-labelled and validated by humans. Textio double-annotate data and only accept judgments that have been validated by several points of view before using it to train Textio’s models. This gold standard data is split and used for training, validation, and testing before being released to the production model.

Tier 2 - Data Specification

4.3.1 - Source data name

Textio

4.3.2 - Data modality

Text

4.3.3 - Data description

Job postings, emails, reports, marketing content and any other form of written communication.

4.3.4 - Data quantities

N/A

4.3.5 - Sensitive attributes

Textio captures user/employee information such as name, email and recruiting managers job title.

4.3.6 - Data completeness and representativeness

To ensure high quality text data for Textio language models, their processes include normalising text, tokenising the text, removing duplicates and irrelevant content, handling missing data, correcting linguistic errors and adding annotation for specific tasks. In addition they conduct manual reviews, these steps help to ensure accurate and reliable datasets.

4.3.7 - Source data URL

N/A

4.3.8 - Data collection

  • All data is stored with Amazon - AWS data and is stored in the US.
  • In regards to the use of meta data, this option has not been activated for MOD as we do not have SSO (Single Sign On). This means information on IP addresses is not being captured.
  • MOD is not a ‘data exchange partner’ - a service highlighted in the contract whereby the organisation shares information from their ATS (Applicant Tracking Software) to measure the performance of Textio job adverts.

4.3.9 - Data cleaning

To ensure high quality text data for Textio language models, their processes include normalising text, tokenising the text, removing duplicates and irrelevant content, handling missing data, correcting linguistic errors and adding annotation for specific tasks. In addition they conduct manual reviews, these steps help to ensure accurate and reliable datasets.

4.3.10 - Data sharing agreements

All data is stored with Amazon - AWS data is stored in the US.

4.3.11 - Data access and storage

Data retention and deletion of personal data: Textio provides a mechanism that allow staff to request the deletion of data. The user can contact [email protected], this is also referenced in the Textio privacy policy. Textio employs several security measures to safeguard user data, including the encryption for secure transmission and storage, strict access controls limiting system and database access to authorised personnel. Textio conduct regular security audits to identify and address vulnerabilities and also it has established incident response protocol for swift detection and mitigation of security incidents. Textio ensures that it complies with data protection regulation (GDPR) and DEFCON 532b.

Tier 2 - Risks, Mitigations and Impact Assessments

5.1 - Impact assessment

Textio employs several security measures to safeguard user data, including the encryption for secure transmission and storage, strict access controls limiting system and database access to authorised personnel. Textio conduct regular security audits to identify and address vulnerabilities and also it has established incident response protocol for swift detection and mitigation of security incidents. Textio ensures that it complies with data protection regulation (GDPR) and DEFCON 532b.

5.2 - Risks and mitigations

Due to MOD employee personal data being stored in overseas territory (MOD staff names, role and email), a data breach may have concerning consequences, i.e. identification of defence personnel. Due to the minimal storage of sensitive data and robust safeguards put in place by the supplier, this was deemed a low level risk according to MOD’s Secure By Design process. The suppliers service agreement, risk assessment and treatment process incident response process and information security policy have been reviewed and approved by subject matter experts within MOD.

Updates to this page

Published 17 December 2024