ชุดข้อมูลคำอธิบายคลังข้อมูลเสียงพูด (Speech Corpus Annotations)

มาตรฐานนี้ดัดแปลงมาจากรูปแบบไฟล์ของชุดข้อมูล Mozilla Common Voice

รูปแบบไฟล์#

{
  "records": [
    {
      "path": <ชื่อไฟล์เสียง: file_reference>,
      "sentence": <ประโยคที่พูด: string>,

      "age": <optional - อายุหรือช่วงอายุของผู้พูด: number or string>,
      "gender": <optional - เพศของผู้พูด: string>,
      "accent": <optional - สำเนียงที่พูด: string>,
      "locale": <optional - ภาษาที่พูด: string - ISO 3166 country code, 
                 lower-case>,

      ... <เพิ่ม key อื่นๆ ที่เกี่ยวข้องกับไฟล์นี้ได้ตามต้องการ>
    },
    ... // หมายเหตุ: 1 javascript object ต่อ 1 ไฟล์
  ]
}

Keys#

Key	Required	Type	Description
`path`	✅	`file_reference` or `url`	Path or URL to the sound file containing the speech
`sentence`	✅	`string`	The sentence that was spoken
`age`		`string`	Age range of the speaker (e.g. `"forties"`)
`gender`		`string`	Gender of the speaker (e.g. `"M"`, `"F"`, `"Other"`)
`accent`		`string`	Accent or dialect used in the utterance
`locale`		`string` (ISO 3166 country code)	Locale of the language (e.g. `"TH"`)

ตัวอย่าง#

{
  "records": [
    {
      "path": "common_voice_th_123456.mp3",
      "sentence": "ดำและแดงไปโรงเรียน",
      "locale": "th",
      "age": "twenties",
      "gender": "male"
    },
    {
      "path": "common_voice_th_123457.mp3",
      "sentence": "ระยะเวลาของโครงการคือ?",
      "locale": "th",
      "age": "fourties",
      "gender": "female"
    }
  ]
}