Open Vietnamese Legal Dataset
VietLex Open Vietnamese Legal Dataset v1.0 — 186,956 rows, 85 MB compressed. Free, CC BY 4.0. For researchers, AI/ML engineers, legal-tech builders worldwide.
Five datasets
| Dataset | Rows | Size | Description |
|---|---|---|---|
| vbpl.ndjson | 60,190 | 39.8 MB | Laws, decrees, circulars, decisions issued by Vietnamese government authorities from 1945 to present. |
| vbpl_effects.ndjson | 27,835 | 4.6 MB | Legal relationships: which document amends, repeals, supersedes, consolidates which. |
| theses.ndjson | 3,373 | 4.1 MB | MA/PhD theses harvested via OAI-PMH from 18 Vietnamese and international university repositories. |
| goi_thau.ndjson | 85,519 | 35.1 MB | Vietnamese public procurement bidding notices (gói thầu mời thầu). |
| ket_qua_lcnt.ndjson | 10,039 | 1.4 MB | Vietnamese public procurement award outcomes — winning contractor, price, approval date. |
Access methods
REST API (NDJSON stream)
GET /api/v1/dump
Stream metadata directly. No auth, fair-use rate limit.
OAI-PMH 2.0 endpoint
GET /oai-pmh?verb=ListRecords
Dublin Core metadata. Compatible with Google Scholar, BASE, CORE, OpenAIRE.
OpenAPI 3.1 + Swagger UI
/cho-ai/api/docs
Try every endpoint interactively. Generate clients in any language.
MCP Server (Anthropic standard)
npx @vietlex/mcp-server
One-line install for Claude Desktop, Cursor, Continue, Cline.
How to cite
@dataset{vietlex_open_2026,
author = {Hoàng, Quốc Hải and contributors},
title = {{VietLex Open Vietnamese Legal Dataset v1.0}},
year = 2026,
publisher = {Zenodo},
doi = {10.5281/zenodo.PENDING},
url = {https://vietlex.vn/du-lieu},
}Legal basis
Per Vietnamese Intellectual Property Law Article 15 §2 (2005, amended 2022), Vietnamese legal normative documents are NOT subject to copyright. The CC BY 4.0 license here applies to the COLLECTION (selection, structuring, metadata extraction, citator graph) and the ENRICHMENT (multilingual titles, tags, hash verification). Underlying laws are public domain.
Personal data scrubbed per Decree 13/2023/NĐ-CP (Vietnam Personal Data Protection).