Embulk parser plugin for xml
Parser plugin for Embulk.
Read data from input as xml and fetch each entries to output.
parser:
type: xml
root: data/students/student
schema:
- {name: name, type: string}
- {name: age, type: long}
xml
.If you need to parse column as timestamp type, schema supports 2 optional parameters:
schema:
- {name: timestamp_column, type: timestamp, format: "%Y-%m-%d", timezone: "+0000"}
"+0900"
is used by default.
parser:
type: xpath
root: //data/students/student
schema:
- {path: name, type: string, name: name}
- {path: age, type: long, name: age}
- {path: hobbies/hobby, type: json, name: hobbies}
xpath
.If you need to parse column as timestamp type, schema supports 2 optional parameters:
schema:
- {name: timestamp_column, type: timestamp, format: "%Y-%m-%d", timezone: "+0000"}
"+0900"
is used by default.Here is XML for xample:
<data>
<result>true</result>
<students>
<student>
<name>John</name>
<age>10</age>
<hobbies>
<hobby>music</hobby>
<hobby>movie</hobby>
</hobbies>
</student>
<student>
<name>Paul</name>
<age>16</age>
<hobbies>
<hobby>game</hobby>
</hobbies>
</student>
<student>
<name>George</name>
<age>17</age>
</student>
<student>
<name>Ringo</name>
<age>18</age>
</student>
</students>
</data>