This page demonstrates the process of importing data from a BigQuery project into a graph in Ultipa.

The following steps are demonstrated using PowerShell (Windows).
Generate the Configuration File
Open the terminal program and navigate to the folder containing ultipa-importer
. Then, run the following command and select bigQuery
to generate a sample configuration file for BigQuery:
./ultipa-importer --sample

A file named import.sample.bigQuery.yml
will be generated in the same directory as ultipa-importer
. If the file already exists, it will be overwritten.
Modify the Configuration File
Customize the import.sample.bigQuery.yml
configuration file based on your specific requirements. It includes the following sections:
mode
: Set tobigQuery
.bigQuery
: Provide your BigQuery access details, including the project ID and the file path to the authentication certificate.server
: Provide your Ultipa server details and specify the target graph (new or existing) for data import.nodeConfig
: Define node schemas, where each schema corresponds to a query result. Columns in the query results are mapped to node properties.edgeConfig
: Define edge schemas, where each schema corresponds to a query result. Columns in the query results are mapped to edge properties.settings
: Set global import preferences and parameters.
# Mode options: csv/json/jsonl/rdf/graphml/bigQuery/sql/kafka/neo4j/salesforce
mode: bigQuery
# BigQuery access configurations
bigQuery:
# ID of the project where the dataset is stored
projectID: "transporter-demo-123"
# File path to the JSON key file for Service Account authentication in BigQuery
cert: "./transporter-demo-123-xxxxxx.json"
# Ultipa server configurations
server:
# Host IP/URI and port; if it's a cluster, separate multiple hosts with commas
host: "10.11.22.33:1234"
username: "admin"
password: "admin12345"
# The new or existing graph for data import
graphset: "myGraph"
# If the above graphset is new, specify the shards where the graph will be stored
shards: "1,2,3"
# If the above graphset is new, specify the partition function (Crc32/Crc64WE/Crc64XZ/CityHash64) used for sharding
partitionBy: "Crc32"
# Path of the certificate file for TLS encryption
crt: ""
# Node configurations
nodeConfig:
# Specify the schema
- schema: "Customer"
# The SQL query to retrieve data from a dataset in BigQuery
sql: "SELECT * FROM `trading.customer`"
# properties: Map SQL query results to properties; columns mapped to system properties such as _id, _from or _to in the SQL query results must be explicitly configured
## name: A column name in the SQL query results
## new_name: The property name; it defaults to the name above
## type: The property type; you can set to _id, _from, _to, or other Ultipa property value types like int64, float, string, etc; set to _ignore to skip importing the column
## prefix: Add a prefix to the values of a property; only apply to _id, _from, and _to
properties:
- name: cust_no
type: _id
prefix:
- name: name
type: string
new_name: cust_name
- name: level
type: int32
- schema: "Merchant"
sql: "SELECT * FROM `trading.merchant`"
properties:
- name: merch_no
type: _id
- name: name
type: string
- name: type
type: string
# Edge configurations
edgeConfig:
- schema: "Transfers"
sql: "SELECT * FROM `trading.transaction`"
properties:
- name: trans_no
type: string
- name: cust_no
type: _from
- name: merch_no
type: _to
- name: time
type: datetime
# Global settings
settings:
# Path of the log file
logPath: "./logs"
# Number of rows included in each insertion batch
batchSize: 10000
# Import mode: insert/overwrite/upsert
importMode: insert
# Automatically create missing end nodes for edges
createNodeIfNotExist: false
# Stops the importing process when error occurs
stopWhenError: false
# Set to true to automatically create new graph, schemas and properties
yes: true
# The maximum threads
threads: 32
# The maximum size (in MB) of each packet
maxPacketSize: 40
# Timezone for the timestamp values
# timeZone: "+0200"
# Timestamp value unit, support ms/s
timestampUnit: s
Execute Import
Execute the import by specifying the configuration file using the --config
flag:
./ultipa-importer --config import.sample.bigQuery.yml
