Support Multi-Tenancy

diggy · October 25, 2018, 4:43am

Moved from GitHub dgraph/2693

What you wanted to do

Create multiple db/schema on the same server such as tradition dbs (Postgres/MySql/SqlServer). Useful for personal VMs and raspberry pi where I can replace traditional db with dgraph and run in only on server.

What you actually did

Add prefix.

Why that wasn’t great, with examples

cumbersome because now I need to create variables for prefix. If I accidental drop schema, I drop everything.

Any external references to support your case

CREATE DATABASE foodatabase or CREATE SCHEMA fooschema.
Would love to have something like localhost:8080/alter/foodatabase. If foodatabase is not provided it would default to existing behavior.

diggy · November 6, 2018, 12:54am

manishrjain commented :

The advantage of a graph DB is that multiple data sources can be combined together into one, and queried across. Given that benefit, having the division of a database is at best a low priority feature request.

diggy · December 24, 2018, 3:05pm

brianbroderick commented :

There are many reasons to have multiple databases; for example, it’s typical to have a dev, test, and prod environment with their respective databases. This makes it so the test database can be recreated before each test run. Right now, the only way to accomplish this is to either have multiple instances of Dgraph running, or to add a prefix to all predicates.

If I only want to clear test environment predicates, adding a prefix complicates queries like this: &api.Operation{DropAll: true}, which I run before any tests. It would also complicate Go structs when determining the right predicate values in JSON.

It’s also typical to work on many micro services at a time, but these micro services should not have any chance of data colliding with each other; they should be completely isolated. It doesn’t seem realistic to have 10+ instances of DGraph running at the same time on a laptop (5+ micro services, each with a dev and test environment)

Lastly, having multiple database support will help people transitioning from an RDMS world to have an easier time making the switch.

diggy · December 31, 2018, 4:01pm

liqweed commented :

A major reason for us multiple databases is such an important factor is multitenancy. We intend to implement multitenancy with a database schema per account. That makes data isolation a lot easier (which includes removing an account for example) without provisioning and maintaining thousands of database servers. Implementing multitenancy in dgraph as a schema per account is even a stronger case in my mind since it lacks any mid-level namespace to segment the data (like tables in SQL/Cassandra or collections in MongoDB). That leaves very few options to go about segmenting the data effectively.

I agree with @brianbroderick - we implemented a microservices approach and one of the services is currently using dgraph. We avoid using dgraph for any other service since it would involve automation complexity which we find hard justifying. Had it been any easier to work with more than a single schema, dgraph usage would certainly proliferate in our case.

diggy · May 24, 2019, 2:13pm

romshark commented :

Support for multiple isolated databases on a single server would significantly increase my API test’s execution speed which is currently over 226 seconds since all tests need to be executed serially! If I had the guarantee of isolated databases I could setup a database instance for each test individually allowing API tests to run in parallel. It’d theoretically be possible to go from 226s to under 10s (which is huge!)

I could do it myself with graph namespacing, but that’d be very error prone since there’s no isolation guarantees, one test could start mutating another tests’s database leading to a big mess.

I hope this feature will be implemented soon!

diggy · June 19, 2019, 5:14pm

aoighost commented :

This would also be extremely useful for my use case as well using dgraph to support multiple workspaces. Also, the ability to reference nodes and create relationships across databases/workspaces would be useful as well.

diggy · June 19, 2019, 11:41pm

romshark commented :

@aoighost

Also, the ability to reference nodes and create relationships across databases/workspaces would be useful as well.

It would be the opposite of useful. If you have relationships across “databases” you have a single database. The multi-database feature is about isolation such that one database is physically isolated from another yet maintained by the same process for convenience.

diggy · June 25, 2019, 10:28pm

aoighost commented :

@romshark good point

diggy · July 13, 2019, 12:04am

campoy commented :

Whoa, this is a popular request!

OK, we’ll be working on this and seeing whether it can be part of our next release v1.2 expected to be released end of September.

diggy · August 1, 2019, 10:56pm

AgentZombie commented :

This might be a good place for the label field in n-quads.

diggy · August 5, 2019, 11:38pm

campoy commented :

Hi there @AgentZombie,

Could you explain what you mean by “the label field in n-quads”?

diggy · August 6, 2019, 1:18am

AgentZombie commented :

Sorry. I was speaking specifically about the graph label field in RDF n-quads. dgraph specifically reads RDF n-quads as a superset of n-triples but doesn’t use the fourth value to specify a named graph.

From RDF 1.1 N-Quads

The simplest statement is a sequence of (subject, predicate, object) terms forming an RDF triple and an optional blank node label or IRI labeling what graph in a dataset the triple belongs to, all are separated by whitespace and terminated by ‘.’ after each statement.

This was referenced here, #1143, and probably other places.

diggy · September 17, 2019, 4:21pm

campoy commented :

I wasn’t aware of that, and it does make sense to consider it as part of our support for multi-tenancy.

Thanks, @AgentZombie

diggy · September 24, 2019, 6:40am

Willem520 commented :

I think it will be a great feature

diggy · October 28, 2019, 2:24am

aoighost commented :

I’d appreciate it if this were not an enterprise feature. I’m trying to build a app that uses dgraph on the backend as a graph store and multi tenancy would make it a lot easier to build without having to spin up a new docker instance for each workspace. Enterprise only would kill the use of that feature for me. I should also note multi tenancy would make it a lot easier for app developers to use dgraph in general, as it would make it easier to have multiple apps on one pc running dgraph for a backend.

diggy · October 30, 2019, 1:17pm

seanlaff commented :

We’re attempting to use one big dgraph instance to serve many discrete customers and need data isolation. This would be a great feature for us.

In the meantime we’ve been experimenting with putting a tenant predicate on every entity- however I worry that this might have some performance drawbacks since every query we send into dgraph has to be a tenant = x query, followed by a @filter of what the end user actually wanted.

From how I understand how dgraph does query planning, I think this means all my queries can only be as fast as that original tenant = x lookup (which hits millions of documents), right? (Since I always need to start at the tenant predicate and then filter)

diggy · November 9, 2019, 7:27am

hubyhuby commented :

We are evaluating / prototyping further the use of dgraph.
For us the minimal set to evaluate Dgraph, requires 3 environments as per the regular dev pipeline : Development / Staging / Production.

Further more the GDPR constrains my company in Europe to partition the data.
We need security by design at the organization level.
A database without Multi-Tenancy feature is a No Go for most companies in Europe.

Even an academic project in Europe cannot use the community edition if they use some kind of personal data (As you cannot tell who can access the data precisely / easily).

As a core DB feature, I believe it should be part of the community edition.

diggy · January 3, 2020, 11:27pm

cosmotek commented :

Another upvote for this feature

diggy · January 9, 2020, 1:12am

ChStark commented :

Another upvote for this feature, it can also help Dgraph Labs to launch their own Dgraph As A Service easier

diggy · January 29, 2020, 4:35pm

dvaldivia commented :

This feature was marked for the 1.2 milestone but I don’t see it in the change log of the 1.2 release, did this feature not made it?

Topic		Replies	Views
Make Multi-Tenancy open source Dgraph kind:feature	35	2835	January 1, 2022
Multiple databases support Users	2	949	March 30, 2019
How would dgraph support multi-tenancy best? Dgraph	4	1088	November 2, 2018
Does DGraph support multiple databases? Users	9	3348	July 15, 2018
Multi Tenancy in Dgraph Dev enterprise , rfc	16	3819	August 26, 2020

Support Multi-Tenancy

What you wanted to do

What you actually did

Why that wasn’t great, with examples

Any external references to support your case

Related Topics