Named Entity Recognition
The Named Entity Recognition model automatically identifies and categorizes key information in the audio.
This feature is in Alpha state.
Breaking changes may still be introduced to this API, but an advanced notice will be sent.
We’re looking for feedback to improve this feature, share yours here.
Named Entity Recognition (also known as Entity Detection) detects and categorizes key information in the audio.
Usage
Enable named entity recognition by setting named_entity_recognition
to true
in your transcription request.
request data
{
"audio_url": "<your audio URL>"
"named_entity_recognition": true
}
Result
The transcription result will contain a "named_entity_recognition"
key with the output of the model:
result
{
"transcription":{...}
"named_entity_recognition": {
"success": true,
"is_empty": false,
"results": [
{
"entity_type": "EMAIL_ADDRESS",
"text": "privacy@gladia.io",
"start": 123.4,
"end": 124.5
},
{
"entity_type": "AGE",
"text": "27 years old",
"start": 234.7,
"end": 235.6
}
],
"exec_time": 1.7726809978485107,
"error": null
}
}
Supported Regulations
Gladia.io helps you covers the following regulations for your business:
- EU General Data Protection Regulation (GDPR)
- California Privacy Rights Act (CPRA)
- USA Health Insurance Portability and Accountability Act (HIPAA Safe Harbor)
- Quebec Privacy Act (law 25)
- Japan Act on the Protection of Personal Information (APPI)
Supported Entities
Entity Type | Description | Example | Information Type | Regulatory Compliance | Note |
---|---|---|---|---|---|
ACCOUNT_NUMBER | Customer account or membership identification number | Policy No. 10042992; Member ID: HZ-5235-001 | PII | Note: Full support for English; Multilingual support in progress | |
AGE | Numbers associated with an individual’s age | 27 years old; 18 months old | PII | GDPR, HIPAA, Quebec Privacy Act, APPI | When given in years, only the number is flagged, but both number and time unit are flagged when given in other units like months or weeks Also includes age ranges: 29-35 years old; 18+; A man in his forties |
DATE | Specific calendar dates, which can include days of the week, dates, months, or years. See also: DATE_INTERVAL, DOB | Friday, Dec. 18, 2002; Dated: 02/03/97 | PII | HIPAA, Quebec Privacy Act | If no calendar date is specified, days of the week are not flagged: Your appointment is on Monday Indexical terms are not flagged: yesterday; tomorrow |
DATE_INTERVAL | Broader time periods, including date ranges, months, seasons, years, and decades See also: DATE, DOB | 2020-2021; 5-9 May; January 1984 | PII | HIPAA | |
DOB | Dates of birth See also: DATE, DATE_INTERVAL | Born: March 7, 1961 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
DRIVER_LICENSE | Driver’s permit numbers See also: VEHICLE_ID | DL# 134711-320 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes International Driving Permits (IDP) and Pilot’s licenses |
DURATION | Periods of time, specified as a number and a unit of time | 8 months; 2 years | PII | Full support for English; Multilingual support in progress | |
EMAIL_ADDRESS | Email addresses | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | ||
EVENT | Names of events or holidays | Olympics; Yom Kippur | PII | ||
FILENAME | Names of computer files, including the extension or filepath | Taxes/2012/brad-tax-returns.pdf | PII | ||
GENDER_SEXUALITY | Terms indicating gender identity or sexual orientation, including slang terms | female; bisexual; trans | PII | CPRA, GDPR, GDPR Sensitive, APPI Sensitive | |
HEALTHCARE_NUMBER | Healthcare numbers and health plan beneficiary numbers | Policy No.: 5584-486-674-YM | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes medical record numbers, health insurance policy/account numbers, and member IDs, for example, German Sozialversicherungsnummer (also used as SSN), Philippine PhilHealth ID number, Ukrainian VHI number |
IP_ADDRESS | Internet IP address, including IPv4 and IPv6 formats | 192.168.0.1 2001:db8:0:0:0:8a2e::7334 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
LANGUAGE | Names of natural languages | Korean; French | PII | GDPR, GDPR Sensitive, APPI Sensitive | |
LOCATION | Metaclass for any named location reference; See subclasses below | Eritrea; Lake Victoria | PII | GDPR, HIPAA, APPI | May co-occur with ORGANIZATION when the context refers explicitly to the organization’s location The patient was transferred to Northwest General Hospital |
LOCATION_ADDRESS | Full or partial physical mailing addresses, which can include: building name or number, street, city, county, state, country, zip code | 25/300 Adelaide T., Perth WA 6000, Aus. 145 Windsor St. Mail to: Kollwitzstr 13, 10405, Berlin | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
LOCATION_CITY | Municipality names, including villages, towns, and cities | Toronto; Berlin; Denpasar | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
LOCATION_COORDINATE | Geographic positions referred to using latitude, longitude, and/or elevation coordinates | We’re at 40.748440 and -73.984559 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
LOCATION_COUNTRY | Country names | Canada; Namibia | PII | GDPR, APPI | |
LOCATION_STATE | State, province, territory, or prefecture names | Ontario; Arkansas; Ich lebe in NRW | PII | GDPR, APPI | |
LOCATION_ZIP | Zip codes (including Zip+4), postcodes, or postal codes | 90210; B2N 3E3 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Optimized for various English-speaking locales (Australia, Canada, United Kingdom, United States), as well as international equivalents |
MARITAL_STATUS | Terms indicating marital status | single; common-law; ex-wife; married | PII | APPI Sensitive | |
MONEY | Names and/or amounts of currency | 15 pesos; $94.50 | PII | ||
NAME | Names of individuals, not including personal titles such as ‘Mrs.’ or ‘Mr.’ | Dwayne Johnson; Mr. Khanna | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
NAME_FAMILY | Names indicating a person’s family or community; often a last name in Western cultures and first name in Eastern cultures | François Truffaut; Ozu Yasujirō | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
NAME_GIVEN | Names given to an individual, usually at birth; often first / middle names in Western cultures and middle / last names in Eastern cultures | François Truffaut; Ozu Yasujirō | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
NAME_MEDICAL_PROFESSIONAL | Full names, including professional titles and certifications, of medical professional, such as doctors and nurses | Attending physician: Dr. Kay Martinez, MD | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
NUMERICAL_PII | Numerical PII (including alphanumeric strings) that doesn’t fall under other categories. See also a section below on international variants as some of them are mapped to this category, for example, Belgian BTW nummer or European VAT number. | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes the following: numbers in the medical field, such as device serial numbers, POS codes, NPI numbers, etc.; computer numbers like MAC addresses, cookie IDs, VPNs, error codes, access codes, message IDs, etc.; business-related numbers like DUNS numbers, company registration numbers, provider IDs, etc.; numbers related to purchasing, like order IDs, transaction numbers, confirmation numbers, tracking numbers, etc.; also numbers assigned to various forms of IDs, files, documents, proceedings, invoices, claim IDs, record IDs, etc. | |
OCCUPATION | Job titles or professions | professor; actors; engineer; CPA | PII | Quebec Privacy Act, APPI | |
ORGANIZATION | Names of organizations or departments within an organization | BHP; McDonald’s; LAPD | PII | Quebec Privacy Act, APPI | May co-occur with LOCATION when the context refers explicitly to the organization’s location Donations can be brought to Royal Canadian Legion Branch 43 |
ORGANIZATION_MEDICAL_FACILITY | Names of medical facilities, such as hospitals, clinics, pharmacies, etc. | Northwest General Hospital; Union Family Health Clinic | PII | Quebec Privacy Act, APPI | |
ORIGIN | Terms indicating nationality, ethnicity, or provenance | Canadian; Sri Lankan | PII | CPRA, GDPR, GDPR Sensitive, Quebec Privacy Act, APPI Sensitive | |
PASSPORT_NUMBER | Passport numbers, issued by any country | PA4568332; NU3C6L86S12 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
PASSWORD | Account passwords, PINs, access keys, or verification answers | 27%alfalfa; temp1234 My mother’s maiden name is Smith | PII | CPRA, APPI | |
PHONE_NUMBER | Telephone or fax numbers | +4917643476050 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
PHYSICAL_ATTRIBUTE | Distinctive bodily attributes, including terms indicating race | I’m 190cm tall; He belongs to the Black students’ association | PII | CPRA, GDPR, GDPR Sensitive, APPI Sensitive | |
POLITICAL_AFFILIATION | Terms referring to a political party, movement, or ideology | liberal; Republican | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
RELIGION | Terms indicating religious affiliation | Hindu; Presbyterian | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | |
SSN | Social Security Numbers or international equivalent government identification numbers | 078-05-1120; ***-***-3256 | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes, for example, Australian TFN, Belgian NISS, British NIN, Canadian SIN, Dutch BSN, German Sozialversicherungsnummer (also used as a healthcare number, see: HEALTHCARE_NUMBER), French INSEE, Indian Aadhaar, Italian TIN, Philippine SSS, Spanish NUSS, Ukrainian TIN, and Mexican NSS formats. Flags mentions of complete numbers as well as the last four digits only. |
TIME | Expressions indicating clock times | 19:37:28; 10pm EST | PII | ||
URL | Internet addresses | www.private-ai.com | PII | CPRA, GDPR, HIPAA, Quebec Privacy Act | |
USERNAME | Usernames, login names, or handles | privateairocks; @_PrivateAI | PII | CPRA, GDPR, APPI | |
VEHICLE_ID | Vehicle identification numbers (VINs), vehicle serial numbers, and license plate numbers See also: DRIVER_LICENSE | 5FNRL38918B111818; BIF7547 | PII | CPRA, GDPR, HIPAA, APPI | |
ZODIAC_SIGN | Names of Zodiac signs | Aries; Taurus | PII | ||
BLOOD_TYPE | Blood types | She’s type AB positive | PHI | CPRA, GDPR, HIPAA, Quebec Privacy Act | |
CONDITION | Names of medical conditions, diseases, syndromes, deficits, disorders | chronic fatigue syndrome; arrhythmia; depression | PHI | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI Sensitive | |
DOSE | Medically prescribed quantity of a medication | limit intake to 700 mg/day | PHI | ||
DRUG | Medications, vitamins, and supplements | advil; Acetaminophen; Panadol | PHI | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI Sensitive | |
INJURY | Bodily injuries, including mutations, miscarriages, and dislocations | I broke my arm; I have a sprained wrist | PHI | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI Sensitive | |
MEDICAL_PROCESS | Medical processes, including treatments, procedures, and tests | heart surgery; CT scan | PHI | CPRA, DPR, HIPAA, Quebec Privacy Act, APPI Sensitive | |
STATISTICS | Medical statistics | 18% of patients | PHI | HIPAA, Quebec Privacy Act | |
BANK_ACCOUNT | Bank account numbers and international equivalents, such as IBAN | Acct. No.: 012345-67 | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | ||
CREDIT_CARD | Credit card numbers | 0123 0123 0123 0123 **** **** ****4252 | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes debit, ATM, Direct Debit, PrePay, Charge Cards, and support for cards that do not have 16 digits such as American Express or China UnionPay cards. Flags mentions of complete numbers as well as the last four digits only. | |
CREDIT_CARD_EXPIRATION | Expiration date of a credit card | Expires: July 2023; Exp: 02/28 | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | ||
CVV | 3- or 4-digit card verification codes and equivalents | CVV: 080 | CPRA, GDPR, HIPAA, Quebec Privacy Act, APPI | Includes institution-specific variants: American Express: CID (card ID), CVD (card verification data) CSC / 3CSC (card security code) China UnionPay: CVN (card validation number) CIBC Mastercard: SPC (signature panel code) Discover: CID (card ID), CVD (card verification data) ELO (Brazil): CVE (Elo verification code) JCB (Japan Credit Bureau): CAV (card authentication value) Mastercard: CVC (card validation code) VISA: CVV (card verification value) | |
ROUTING_NUMBER | Routing number associated with a bank or financial institution | 012345678 | Includes international equivalents: Canadian & British sort codes, Australian BSB numbers, Indian Financial System Codes, Branch/transit numbers, Institution numbers, and Swift codes | ||
CORPORATE_ACTION | Any action a company takes that could affect its stock value or its shareholders | Bridge Investment Group LLC (later renamed Bridge Investment Group Holdings LLC); We’ve merged two neighboring retail locations | Only available with our English models. | ||
FINANCIAL_METRIC | Financial metrics or financial ratios are quantitative indicators of a company’s financial health | adjusted earnings per share declined year-over-year; Online sales slow as UK shoppers rein in Christmas spending | Only available with our English models. | ||
MEDICAL_CODE | Codes belonging to medical classification systems such as SNOMED, ICD-10, NDC, etc. | 1981-03-11T04:11:32-03:00 Forearm sprain SNOMED-CT 70704007; | PHI | CPRA, GDPR, GDPR Sensitive, HIPAA, Quebec Privacy Act, APPI Sensitive | Only available with our English models. |
PRODUCT | Names or model numbers of items made by an organization; includes intangible products like software and games, as well as other services | iPhone; Toyota Camry | Only available with our English models. | ||
TREND | A description of the “quality” or the direction in which a financial measurement is going | reflecting the accelerating shift of off-line to online; amid rising costs and shrinking profits | Only available with our English models. |
Was this page helpful?