Collected Data¶
Important
- This section describes collected datasets that are used to create data for the Vietnam Transport Risk Analysis (VTRA)
- The datasets listed here are specfic to Vietnam and are used as inputs to data in the Processed Data Assembly steps
- To implement the VTRA pre-processing without any changes in existing codes, all data described here should be created and stored exactly as indicated below
Networks¶
- All pre-processed networks data are stored:
- In sub-folders in the file path -
/data/pre_processed_networks_data/
- As Shapefiles with of network nodes and edges
- The names of files and folders are self-explanatory
- See
/data/pre_processed_networks_data/networks_description.xlsx
for details of all shapefiles
- In sub-folders in the file path -
- All nodes should have the following attributes:
node_id
- String Node IDgeometry
- Point geometry of node with projection ESPG:4326- variable list of attributes depending upon sector
- All edges should have the following attributes:
edge_id
- String edge IDg_id
- Integer edge IDfrom_node
- String node ID that should be present in node_id columnto_node
- String node ID that should be present in node_id columngeometry
- LineString geometry of edge with projection ESPG:4326- variable list of attributes depending upon sector
Note
We assume that networks are provided as topologically correct connected graphs: each edge
is a single LineString (may be straight line or more complex line), but must have exactly
two endpoints, which are labelled as from_node
and to_node
(the values of these
attributes must correspond to the node_id
of a node).
Wherever two edges meet, we assume that there is a shared node, matching each of the intersecting edge endpoints. For example, at a t-junction there will be three edges meeting at one node.
Cost attributes¶
- Data to assign transport costs to network edges are stored:
- In the file in path -
/data/pre_processed_networks_data/mode_costs.xlsx
- As Excel sheets
- In the file in path -
- All cost estimtates should have the following attributes:
time_cost_usd
- Float values of rate of timetariff_min_vnd
- Float values minimum tariff rate in VND/ton-km (VND/ton for multi)tariff_max_vnd
- Float values maximum tariff rate in VND/ton-km (VND/ton for multi)tariff_min_usd
- Float values minimum tariff rate in USD/ton-km (USD/ton for multi)tariff_max_usd
- Float values maximum tariff rate in USD/ton-km (USD/ton for multi)- attributes to decide how the costs are allocated to network edges (if none then all edges have same criteria)
Road design attributes¶
- Data to assign characteristics to roads are stored:
- In the file in path -
/data/pre_processed_networks_data/road_properties.xlsx
- As Excel sheets
- See
/data/pre_processed_networks_data/road_properties.xlsx
for data description
- In the file in path -
VITRANNS2 OD data¶
- VITRANSS2 province-level OD matrices are stored:
- In the path -
data/OD_data/
- As Excel sheets
goods
sheet gives OD values by commoditymodes
sheet gives OD values by mode
- In the path -
- Aggregated goods-wise province-level national OD matrices have attributes:
o
- Integer IDs of origin Provincesd
- Integer IDs of of destination Provincesname o
- String names of origin Provincesname d
- String names of destination Provincescommodity_names
- Float values of daily tonnages of commodities/industries between OD Provinces
- Aggregated mode-wise province-level national OD matrices have attributes:
o
- Integer IDs of origin Provincesd
- Integer IDs of of destination Provincesname o
- String names of origin Provincesname d
- String names of destination Provincesmode_names
- Float values of daily tonnages along modes between OD Provinces
IFPRI crop data¶
- IFPRI crop datasets are stored:
- In the path -
data/Agriculture_crops/
- As GeoTiff files
- Only files with names
SPAM_P_crop name_ver3.tif
are used - See Excel sheet in path
data/Agriculture_crops//crop_data/crop_unit_costs.xlsx
for costs of crops
- In the path -
- All crop GeoTiff datasets should have attributes:
- values greater than 0
- raster grid geometry
- projection systems: Default assumed = EPSG:4326
RiceAtlas data¶
- RiceAtlas datasets are stored:
- In the path -
data/rice_atlas_vietnam/
- As Shapefiles
- Only the file
rice_production.shp
is used
- In the path -
- The essential attributes in the dataset are listed below. See the data for all attributes:
sub_region
- String names of Provinces in EnglishP_Jan
, …,P_Dec
- Column names with float tonnage produced in each month from January to Decembergeometry
- Polygon geometries of Provinces
Points of interest data¶
- Locations of populations, commune, district, province center committee points datasets are stored:
- In the path -
data/Points_of_interest/
- As Shapefiles
- In the path -
- The essential attributes in all the dataset are listed below. See the data for all attributes:
geometry
- Point geometry with projection ESPG:4326