Back to Insights
Governance

How Professional Services Firms Can Use Generative AI Safely: From Name Masking to Data Classification and Client-Matter Isolation

Ascend Gravity ResearchJune 1, 202614 min read

A masked name does not mean a client is no longer identifiable. For Hong Kong professional services firms, generative AI confidentiality governance must bring data classification, residual context, tool boundaries and client-matter isolation into everyday workflows.

Key Takeaways

  • Lower-risk use cases usually include public-information research, generic templates, internal preparation without client context and non-client-specific language polishing.
  • KYC, client originals, transaction strategy, family structures, litigation background and context that can reconstruct a client matter should be excluded from external models by default.
  • Management should govern more than a name-masking rule: data classification, tool boundaries, output review, audit records and client-matter isolation all need to be designed into the workflow.

For many professional services firms, generative AI may already be appearing in summarisation, drafting, translation, document retrieval and internal analysis. Even where usage remains at the level of individual staff experimentation, client-confidentiality questions have already moved into new contact points: prompts, file uploads, knowledge-base retrieval, vendor logs and model outputs.

The most common first step is to mask names, document numbers, phone numbers and emails. This article uses name masking as shorthand for that action. It is worth standardising because it removes the most visible identifiers. In professional services data, however, client identity and matter background often sit in combinations of dates, jurisdictions, assets, relationships and transaction chronology.

In professional services, client confidentiality rarely depends on names alone. Corporate services teams handle director changes, beneficial ownership and shareholding records. Family offices handle relationships, asset allocation and cross-border planning. Finance, real estate, legal and advisory teams handle KYC records, transaction chronology, property details, dispute background and internal analysis. Even after direct identifiers are removed, these facts can still form a pattern that points back to a client or matter.

For professional services firms, generative AI confidentiality governance also covers confidential information beyond personal data. Even information that may not qualify as personal data can still be a client trade secret, transaction strategy, family arrangement, litigation strategy or confidential fact inside a professional engagement.

Where AI misuse most often appears by service line

Corporate services and company secretarial
Director changes, beneficial ownership, shareholding percentages, group restructuring timelines and unusual transaction background can be pasted into summarisation tools as ordinary context. Beyond name masking, dates, percentages, role relationships and transaction reasons may also need to be generalised.
Finance, asset management and trusts
KYC, source of funds, investment intent, asset allocation, trust structures and client risk appetite can reveal the client profile more clearly than a name. Client profiles and asset structures should not enter general-purpose models; internal retrieval should also be isolated by client and matter.
Legal and disputes
Litigation chronology, counterparty descriptions, evidence summaries, draft advice and internal strategy may involve both client confidentiality and legal privilege. The assessment cannot stop at whether personal data has been removed.
Real estate transactions and family offices
Property location, transaction amount, family relationships, cross-border arrangements and timelines need to be handled together. A matter may still be reconstructed through location, amount, role relationships and chronology even after names are masked.

This article narrows the issue to generative AI workflows: how firms can start with masking names, then continue assessing residual context, model behaviour and the toolchain, before placing control points back into everyday operations. The core question is: after direct identifiers are removed, can the remaining context still allow a model, vendor, internal user or attacker to infer who the client is, which client matter is being discussed, and which sensitive facts are likely true?

For professional services firms, confidentiality governance needs to work in three layers: define whether the remaining data is still identifiable, measure re-identification and inference leakage risk, then place control points back into everyday workflows.

Professional services firms do not need to treat every generative AI use case as equally risky. Management can begin by drawing three boundaries so teams know what can be opened, what requires caution and what should not be sent externally by default.

Generative AI usage boundaries

Recommended levelTypical scenariosManagement judgement
OpenPublic-information research, generic templates, internal preparation without client context, non-client-specific language polishing.Use approved tools, keep basic usage records and apply output-review expectations.
Open with cautionDepartment workflow summaries, de-identified training material, low-risk internal drafts.Mask names first, then generalise dates, locations, amounts and roles; check whether the output reintroduces sensitive clues.
Do not send externally by defaultKYC, client originals, transaction strategy, family structures, litigation background, client-matter files.Use private environments, isolated indexes, human summaries or specific approval workflows.
High-risk programmeCross-client knowledge-base retrieval, agents that automatically read email, documents, spreadsheets or databases.Require permission isolation, logging, red-team testing and human supervision; do not rely on staff judgement alone.

Start with Names

Many internal AI pilots begin by masking direct identifiers: detecting and deleting, replacing or hiding names, ID numbers, passport numbers, phone numbers, emails and addresses. That step is useful and should be standardised. It addresses the most visible layer.

Four terms that are easy to blur

Masking
Deleting, replacing or hiding direct identifiers such as names, ID numbers, addresses, phone numbers or emails.
Pseudonymisation
Replacing identity with a code or substitute, reducing direct linkage while leaving a path back through additional information.
De-identification
Reducing the ability of data to point to a specific person, client or matter, including the residual clues left behind.
Anonymisation
The highest bar: re-identification is not reasonably possible under the relevant conditions.

In professional services, the identifying force often sits in combined signals:

  • A jurisdiction, a director appointment date, a shareholding percentage and a corporate structure.
  • A property, a lease chronology, a transaction amount and a financing arrangement.
  • A family relationship, an immigration plan, a tax issue and an asset class.
  • A litigation timeline, a court procedure, a counterparty description and an internal strategy memo.

Each data point can look harmless in isolation. Together, they may be more sensitive than a name. A team may treat the material as background context, while in a specific setting it can still form a reconstructable client-matter profile.

The same task: weaker and safer prompts

TaskAvoidSafer
Meeting-note summarySummarise these meeting notes: the client is a second-generation Hong Kong family dealing with a Singapore trust, a London residential property restructuring and a holding-company director change in Q3.Using the generalised content below, prepare an internal action list for a client involving cross-border asset planning, trust-structure adjustment and company-governance matters. Do not retain specific jurisdictions, dates, asset locations, family relationships or identifiable transaction chronology.
Document draftingUsing the following KYC, asset allocation and family-member background, draft the next-step advice for the client.Using an abstract scenario that excludes client identity, asset scale and family relationships, provide a general document structure and checklist. The final content will be completed by a professional reviewer.
A redacted document surrounded by contextual clues that form a fingerprint pattern
Even after names are masked, dates, locations, assets, relationships and transaction context can combine into an identifiable client-matter profile.

How Residual Context Can Reconstruct a Client Matter

Name masking handles the most visible layer. The next questions are whether the remaining data can still single out a person, client or matter; whether it can be joined with other documents to form a fuller profile; and whether a model can infer sensitive facts that were not directly stated.

Regulatory and technical frameworks often split these residual risks into three categories. The former EU Article 29 Working Party framed them as singling out, linkability and inference in its opinion on anonymisation techniques. It also made clear that pseudonymisation reduces linkability to the original identity while leaving residual identification risk.

The European Data Protection Board (EDPB) takes the same direction in its 2025 pseudonymisation guidelines, adopted for public consultation: data that can still be attributed to a natural person using additional information may remain personal data, and the assessment can include external information that is reasonably expected to be available.

The US National Institute of Standards and Technology (NIST) makes the operational point clearly in SP 800-188: de-identification usually requires specialised tools to manipulate data and estimate re-identification risk; tools that only mask personal information remain at the preprocessing layer and do not complete a full de-identification assessment.

These EU and US frameworks may not directly create legal obligations for Hong Kong institutions, but they provide useful governance language and technical standards.

In a generative AI workflow, field-level masking answers one question: "Are obvious identifiers still present?" It leaves the harder client-confidentiality questions open:

Questions that remain after masking

Is the remaining combination rare?
Rare combinations can single out a client even without a name.
Can records be linked across documents?
Separate summaries may look safe, then merge into a complete matter trail.
Can the model amplify inference?
A model may turn fragments into a specific identity, financial profile or strategic intent.
Does the toolchain retain context?
Logs, vector stores, RAG indexes and agent tools can extend the life of sensitive context. RAG is a pattern where the model retrieves internal files or knowledge-base material before answering.

How Generative AI Amplifies Inference Risk

Traditional masking rules often assume that the attacker is looking for fixed fields. Generative AI models read relationships, implications, chronology and context from natural language. As a result, residual text that looks harmless can become inferentially useful.

An academic study published at ICLR 2024 found that large language models could infer personal attributes such as location, income and sex from real Reddit profiles, reaching up to 85% top-1 and 95.8% top-3 accuracy. The authors also found that common text anonymisation and model alignment offered limited defences against this type of inference.

In professional services, inference leakage often looks subtler than a model printing a client's name. It may look like:

  1. The model combining a family profile, asset class, jurisdiction and transaction timeline into a recognisable outline.
  2. A summary retaining amounts, dates, relationships or locations that should have been generalised.
  3. Multi-turn chats exposing context gradually, making later prompts more identifying than any single prompt.
  4. RAG or tool-calling agents pulling documents, emails and spreadsheets that were never meant to be joined in one context.

The UK National Cyber Security Centre (NCSC) points to a related design issue in its analysis of prompt injection: current large language models lack a natural security boundary between data and instructions inside a prompt. In a high-confidentiality workflow, leakage can occur across input preprocessing, retrieval, tool use and generated output.

Turn Confidentiality Risk into Observable Indicators

Management teams need comparable indicators. Collapsing confidentiality risk into one universal score can hide the differences between risk layers. It is more defensible to measure layers separately.

Three observable layers

  1. Direct identifiers

    Whether names, documents, phones, addresses, emails or client codes remain. This is the easiest layer to scan, and the layer most often treated as the complete governance answer.

  2. Quasi-identifiers

    Whether dates, locations, amounts, relationships, jurisdictions and client-matter facts form rare combinations. This layer is about relationships across fields.

  3. System behaviour

    How the model, RAG layer, tools, logs and output filters behave under misuse or attack. This layer determines whether residual signals can be amplified or re-linked.

For structured data, technical teams can use re-identification probability, record-matching probability, k-anonymity or differential privacy to assess risk. Management does not need to design every metric; it does need to understand what question each metric answers, and whether the evidence can be repeated, compared and audited.

For unstructured professional-services material, risk measurement can begin with proxy indicators. They will not produce one exact leakage probability, but they make confidentiality risk repeatable, comparable and reviewable.

Management questions and proxy indicators

Management questionDoes this prompt still contain clear client data?
Measurable proxy indicatorResidual PII count; PII detection recall.
Management questionCould the client still be guessed without a name?
Measurable proxy indicatorQuasi-identifier uniqueness; candidate-client reduction ratio.
Management questionCould multiple documents expose the matter when combined?
Measurable proxy indicatorCross-document link success rate; candidate-matter reduction ratio; matter reconstruction risk score.
Management questionCould the model complete clues into sensitive facts?
Measurable proxy indicatorAttribute inference success rate; red-team attack success rate.
Management questionCould the output re-disclose sensitive information?
Measurable proxy indicatorOutput PII hit rate; sensitive-entity reproduction rate.

NIST's 2025 differential privacy guidance makes the same point in technical terms: one epsilon value is not a full protection model; teams still need to define the protected unit, the query model, the threat model, side channels, system security and access controls. NIST's Generative AI Profile points in a similar direction: remove unnecessary PII before input, filter outputs for privacy risk, verify sources and citations, keep assessing safety controls, and periodically use AI red-teaming to test risks such as prompt injection, membership inference and model extraction.

Set Control Strength by Data Sensitivity

Controls by data sensitivity

Low-sensitivity workflows

Typical scenarios include public-information research, generic templates and internal preparation without client context.

  • Mask direct identifiers and retain general background.
  • Limit use to a single document or short context.
  • Use an approved tool list.
  • Run a basic PII scan.
Medium-sensitivity workflows

Typical scenarios include team workflows, de-identified summaries and low-risk internal drafts.

  • Generalise dates, locations, amounts and roles in addition to masking.
  • Limit retrieval by team or workflow.
  • Use a prompt gateway to check data type, purpose and vendor route.
  • Filter outputs and verify sources.
High-sensitivity workflows

Typical scenarios include KYC, client originals, transaction strategy, family structures, litigation background and client-matter files.

  • Do not send raw client-matter text by default; use private environments, isolated indexes or human summaries.
  • Separate context by client and matter, and prohibit cross-client data joining.
  • Block, de-sensitise or require human approval for high-sensitivity prompts.
  • Require human review before delivery, with audit records.

The goal is to make the risk rationale explicit. Low-sensitivity workflows can stay lighter. High-sensitivity workflows need institutional controls that give employees clear boundaries for moment-by-moment judgement.

From Regulatory Principles to Everyday Workflows

Hong Kong's PCPD published the Model Personal Data Protection Framework in 2024, covering AI strategy and governance, risk assessment and human oversight, system implementation and management, and stakeholder engagement.

It also refers to acceptable inputs, permitted and prohibited prompts, traceability, auditability, data security, red teaming, incident response and continuous monitoring.

For a professional services firm, that can be translated into a more concrete operating flow:

  1. Classify data: separate client data, client-matter data, public data, internal strategy and delivery documents.
  2. Classify use cases: distinguish research, summarisation, drafting, client delivery, decision support and automated execution.
  3. Control before input: use a prompt gateway to detect direct identifiers, quasi-identifiers, confidential clauses and cross-client-matter clues.
  4. Minimise context: provide only the documents, fields and chronology needed for the task.
  5. Set model and vendor boundaries: choose enterprise SaaS, private deployment, local models or no generative AI according to sensitivity.
  6. Scan after output: check whether the model reintroduced names, relationships, locations, amounts, strategy or unsupported sources.
  7. Review and log: keep a clearly responsible reviewer for high-sensitivity workflows and log prompts, versions, sources, reviewers and exceptions.
  8. Red-team and monitor: test prompt injection, unauthorised retrieval, exfiltration and inference reconstruction, then report results through a governance dashboard.

Common terms used here

Prompt gateway
A control layer that checks, blocks, de-sensitises or records prompts before employees send them to a model.
RAG
A pattern where the model retrieves internal files or knowledge-base material before answering, often used for enterprise document Q&A.
Tool-calling agent
A model workflow that can call external tools, search, databases or system actions.
Matter-level compartmentalisation
Separating context by client and matter to avoid mixing data across clients, permissions or projects.
Red-teaming
Testing the system with simulated attacks or misuse scenarios to see whether it leaks data, calls tools without authority or violates policy.
Epsilon
A parameter used in differential privacy to express privacy loss; the number alone does not represent the whole protection model.

Why Clients Will Care

This reaches beyond internal compliance. It affects client trust and business development. Large corporates, family clients, financial institutions and cross-border transaction clients are increasingly likely to ask in RFPs, onboarding reviews or information-security questionnaires whether client data enters third-party AI tools, whether vendors retain inputs, whether staff can use external models on their own, and whether outputs are reviewed by an accountable professional.

One improper upload may not immediately create a public breach, but it can still weaken confidence in the firm's confidentiality culture. For professional services firms, the stronger position is using AI to improve efficiency while preserving client confidentiality, professional judgement and accountability boundaries.

Management Self-Check: Five Questions

For professional services teams already piloting or deploying generative AI in client work, the following five questions can serve as a management self-check:

  1. Can the institution identify which prompts or file uploads have contained client, client-matter or transaction context?
  2. Has the institution assessed whether dates, locations, amounts, roles and structures remain identifiable after direct identifiers have been processed?
  3. Can the institution restrict RAG or agents from joining context across different clients, permissions or client matters?
  4. Has the institution tested data leakage risk under prompt injection, unauthorised retrieval and inference reconstruction scenarios?
  5. Has the institution defined which AI outputs may be used as internal references and which require qualified professional review before delivery?

If the answers are unclear, the workflow governance model is probably still unfinished. Model selection is only one part of the issue. Masking names reduces part of the risk; if the remaining context can reconstruct the client's fact pattern, residual confidentiality risk still needs to be identified, recorded and controlled.


Selected References