How does it work?

Data Audit

To gain value from general LLMs they need to be augmented with custom proprietary data sources. This requires knowing what data the organisation has, where it is stored and how to access and surface it to the LLM. In addition data compliance categorisation, anonymisation (e.g. vectorisation) and PII protection concerns need to be highlighted and mitigated.

‍

Data Liberation

Data sources need to be aggregated and orchestrated at scale. Serverless Data Lakes and ETL process can be leveraged to extract data from heterogeneous locations and formats, enabling access to LLM training. The scale of data to generate completely new LLMs is typically out of the range of any one company's data set, yet customisation of an existing LLM requires a much smaller data volume. Existing generic LLM models (e.g. Amazon Titan) can act as Foundational models and adapted with custom data.

Model Hosting and Observability

Once the foundational LLM has been adapted with custom data an API interface is built around the inference engine to provide access to the model to applications and services. This API can be built for a range of internal products as well as productised to third parties. In the fast moving world of regulation around LLMs strong observability and safeguards need to be built into this interface for audit and prevention of data compliance issues.

Book Your Free Consultation Session

No one likes to be the last to the party. If you’re looking to get into serverless, we’ve got experts waiting to help.

Fill in your details below, or simply give us a call on +442045716968

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Large Language Model Customization

Large Language Model Customization

Large Language Model Customization

Custom LLMs

How does it work?

Relevant Case Studies

Book Your Free Consultation Session