RAG - HR

Learn about fundamental concepts of Superlinked.

Many companies aim to make HR documents more accessible to employees and contractors. A recent promising approach is building a RAG system on top of HR documents. This is because it

  • provides a unique response to any query,

  • reduces hallucinations as the answer has to be grounded in the retrieved context,

  • makes the process highly scalable due to automatic question-answering.

However, this solution possesses its unique challenges. For example

  • keeping the knowledgebase consistent and up-to-date,

  • running LLMs efficiently,

  • ensuring the generated results are correct and aligned with company communication guidelines,

In our case, we have three HR policy sources for our hypothetical company.

  1. An older HR policy, which contains our maternity leave policy and details on manager responsibilities.

  2. A more recent update, that contains inaccurate information on management responsibilities - but one that also contains unique information on paternity leave.

  3. A newer policy document that contains updated information about management responsibilities, correcting the mistakes of the previous modernising attempt - alongside some other HR policy information.

A good system will be able to:

  • provide relevant information on maternity leave (only covered in the old document),

  • synthesize contradicting information and only present to us the correct ones

Regarding synthesizing information, there are contradictions between the documents pertaining to management responsibilities. We are going to use the

  • creation_date of the policy,

  • and the usefulness score of each paragraph

as a proxy to know for similar documents with

  • similar information but different wording, and some important parts slightly altered, or

  • also talking about seemingly the same topic, but without conveying useful information

which one is more relevant to our query.

Follow along in this Colab

Last updated