Data Joins
Data Joins in Analytical Data Sharing
Combine datasets from multiple sources to create unified, enriched data products. Data joins in Vendia help you build comprehensive, analytical datasets by merging tables from the same or different connected sources.
Important: Data joins require data to be ingested into Vendia tables first, then new combined tables can be created.
Join Capabilities
- Multiple Join Types: Support for INNER, LEFT, RIGHT, and FULL OUTER joins
- Cross-Source Joins: Join tables from the same or different connected sources
- Flexible Interface: Use either the intuitive UI wizard or SQL console for advanced joins
- Column Control: Select specific columns and apply aliases for clean output
How to Create Data Joins
Vendia provides an intuitive interface for joining tables using either a UI wizard or the SQL console. Follow these steps to combine data from multiple Vendia tables when creating a new Vendia table:
Step 1: Configure Your Join
Select Tables
- Left Table: Choose your primary table from the dropdown
- Right Table: Choose the table you want to join with
Choose Join Type
Select the appropriate join type for your use case:
- INNER JOIN: Returns only matching rows from both tables (most common)
- LEFT JOIN: Returns all rows from the left table and matching rows from the right table
- RIGHT JOIN: Returns all rows from the right table and matching rows from the left table
- FULL OUTER JOIN: Returns all rows when there’s a match in either table
Step 2: Define Join Conditions
Specify how rows should be matched between tables:
- Left Table Column: Select the column from your left table
- Operator: The join interface uses
=
for exact matches (for other operators, use the SQL console tab) - Right Table Column: Select the corresponding column from your right table
Example: Join customer tables on email hash
customer_data_a.email_hash = customer_data_b.email_hash
You can add multiple join conditions by clicking “Add Condition” for more complex joins.
Note: For advanced join conditions using operators other than =
(such as >
, <
, LIKE
, etc.), use the SQL console tab to write custom SQL queries.
Step 3: Select Output Columns
Choose which columns to include in your results:
- Return all columns: Include all columns from both tables (prefixed with table names)
- Select specific columns: Choose only the columns you need for your analysis
Step 4: Preview and Execute
- Review your join configuration
- Click “Run Query” to execute the join
- Preview the results to ensure they meet your expectations
- Save the joined table as a new Vendia table for sharing
Join Best Practices
- Consider data types: Ensure joining columns have compatible data types
- Start simple: Begin with basic joins and add complexity as needed
- Validate results: Always preview results to confirm the join logic is correct
Common Join Scenarios
Customer Data Enrichment
Join customer tables from different systems to create a unified customer view:
-- Example: Combine customer data from two different sourcesSELECT *FROM customers_system_a aINNER JOIN customers_system_b b ON a.email_hash = b.email_hash
Transaction Analysis
Combine transaction data with customer information:
-- Example: Join orders with customer demographicsSELECT c.customer_id, c.age_group, o.total_cost, o.order_dateFROM customer_data cINNER JOIN order_data o ON c.customer_id = o.customer_id
Regional Insights
Join data across different geographical regions:
-- Example: Combine sales data from different regionsSELECT r1.*, r2.*FROM region_us r1FULL OUTER JOIN region_eu r2 ON r1.product_id = r2.product_id
Example Use Cases
- Create tenant- or customer-specific datasets by joining customer data with transaction history
- Build comprehensive, unified datasets for analytics by joining data from multiple sources
- Enrich primary datasets with reference data or lookup tables
Learn More
- Data Transformations
- Getting Started with Analytical Data Sharing
- Analytical Data Sharing FAQ
- Contact Vendia Support for specific questions.