MongoDB Schema Design . can't get what i want

MongoDB Schema Design . can't get what i want - php

Well, this is a very relational problem, and using a non-relational database for such a problem requires some effort. In general, I think your schema design is good.
What you're describing is called "the N+1 problem", because you'll have to make N+1 queries for N objects (in your case, it's more complicated, but I guess you get the idea).
Some remedies:
You can use the $in operator to find e.g. all tracks of a certain artist:
db.tracks.find({"artists" : { $in : [artist_id_1, artist_id_2, ...] } });
This doesn't work if the array of artists grows huge, but a few hundred, maybe a thousand should work fine. Make sure artists is indexed.
You can denormalize some of the information that is needed very often. For example, you might want to show the track list very often, so it makes sense to copy the artist's names to every track. Denormalization depends mostly on what you're trying to achieve from an end-user perspective. You might not want to store each and every artist's name in full, but only the first 50 characters because the UI doesn't show more in the overview anyway.
In fact, you're already denormalizing some data, such as the artist ids in album (which are redundant, because you could get them via the tracks as well). This makes queries easier, but it will be more write-heavy. Updates are ugly because you'll have to make sure they propagate through the system.
In some cases, it might make sense to 'join' on the client(!) rather than the server. This doesn't really fit your problem well, but it's noteworthy: suppose you have a list of friends. Now the sever will have to look up each friend's name whenever it displays them. Instead, it could provide you with a lookup table ids/friends, and the server only serves the ids. Some JavaScript could replace the ids with the real names from the client's cache.

Related

How to stop array results from being merged when they share the same array key?

To build the array of posts, you need to append elements to the array of posts. Currently, you are just assigning a single element to the array over and over, which overwrites the previous value of the entire array.
The code to append posts:
$statuses['res_' . $row2['resolution_id']]['post'][] = ['post_title' => $row2['title'], 'post_description' => $row2['description'], 'categories' => $row2['category_names']];
Note the [] which I added to the end of the left side of the assignment operator.

Mongo $sum slow

I was holding off on answer, because I was sure that some MongoDB experts will answer. However as no one is giving answers, I will give few hints. Maybe something of that can help. But then again - I'm not a MongoDB expert. Take everything with small grain of salt.
1) Which version are you using? If you are still on 2.6 - try out 3.0.x (or newer) with WiredTiger engine.
2) If you have a lot of data sharding can greatly help. This will increase setup complexity, but as you will be able to process parts of data set in paralell, you can get significant speed gains. But be careful with choosing proper sharding key.
3) Consider creation of several collections which can act as smaller views. Example: if you currently have 15 fields in [..] there is great chance that lots of queries just use 1 or 2 at once. Like country. Create one more collection in which you use country data and skip rest. If query uses only country fields and not other of those 15, then use small collection. If query uses more fields, use big one. That way queries on countries will be much faster as you will be able to group data more. However not always this is possible as it adds extra complexity in building such small collections. If you process data in some queue (to insert in big), you could insert in small too. Or you could use some aggregate queries and $out to build smaller tables once every X minutes.
4) Come up with 3rd schema. Yours 2nd schema is easy to put data in, but its hard to get data out. You could use arrays more. That way it will be harder to get data in, but much more easy and faster to query it. Keep in mind that in your 2nd schema and in my sample for 3rd schema documents are growing and there can be need for MongoDB to move them around on disk and that is really slow operation. Test if that affects your setup. Small example of potential collection schema:
{
"user": "asd",
[...],
"date": ISODate("2015-07-01T00:00:00Z"), // first date of the month
"total": 2222,
"daily": [
{"date": ISODate("2015-07-01T00:00:00Z"), "total": 22},
{"date": ISODate("2015-07-11T00:00:00Z"), "total": 200},
{"date": ISODate("2015-07-20T00:00:00Z"), "total": 2000},
]
}
When inserting data you can use update with criteria (if you are in PHP): $criteria = ["user": "asd", "daily.date": new MongoDate("...."), // other fields] and update clause $update = ['$inc': ["total: 1, 'daily.$.total': 1]] . Check how many rows were updated. If 0, then create insert from the same data. I.e. unset $criteria['daily.date'] and change update to $update = ['$inc' => ['total' => 1], '$push' => ['daily' => ['date' => new MonoDate('..'), 'total': 1]]]. Keep in mind that you can run into problems if you have several scripts which insert data. Better do everything in queue by one. Or you do in parallel make sure that $push does not result in adding several daily.date with the same date. So - you try to update, if cant update, insert. As you use arrays and possitional operator, you can't use upserts. That's why there is extra insert needed. As I said, its more complicated to get data in. But it will be more easy to get data out. Make sure to set up proper indexes. For example on 'daily.date' etc. So that update queries would not need to check lots of documents. Even more - you can create some hash field to put [...] fields which would hold hash of all [...] fields. And use that in update. That way it will be much more easy to create small index to pinpoint particular document (you put in index 'daily.date', hash field and few more, but will not need to put 15 [..] fields).
When you have such structure you could do a lot of things with queries. For example - if you need full months, just query on date and [...] fields that you need, sum total and you are good. If you need some date range (like 1st - 10th of the month) you can query by [...] fields and date, project to get rid of unnecessary fields, $unwind daily, match again, but this time on daily.date field, then project to rename fields, then group and sum. It's much more flexible than use of $date.years.2015.months.07.days.03.total .
Keep in mind that all of those are just hints. Test everything on your own. And maybe 1 o 5 hints will work. But that can make all the difference.

PHP CodeIgniter - how to get good result array when relational table (there are same column name in different table)

I am trying to make pretty query result like doctrine, and other ORM
for example with relational table article and article_category.
i want to get query result like this :
Array
(
[0] => Array
(
[id] => 1
[title] => I am article title
[slug] => i-am-article-title
[category] => Array
(
[id] => 1
[name] => Category Name
[slug] => category-name
)
)
[1] => Array
(
[id] => 2
[title] => How to coding
[slug] => how-to-coding
[category] => Array
(
[id] => 4
[name] => Tutorial Area
[slug] => tutorial-area
)
)
)
i know this is basic, but i am want to know for create that result in very simple way.
thanks for all advice
UPDATED.
for to get that result, I am change using eloquent laravel framework.. . :)

No, you can't get this information in this way directly from your database if you are using a Relational Database like MySQL or PostgreSQL
You can get the effect you wish in two queries and insert the subquery array to your result array, or you can have a different table for your categories and do a JOIN with SQL.
As a note, other database systems return just what you asked, consider switching to MongoDB (a No-SQL solution) it returns an object just like you wished

MongoDB Array Search in Query or client side

I am wondering what is better to do. I have a pulled back a query like this:
Array
(
[_id] => MongoId Object
(
[$id] => 4eeedd9545c717620a000007
)
[field1] => ...
[field2] => ...
[field3] => ...
[field4] => ...
[field5] => ...
[field6] => ...
[votes] => Array
(
[whoVoted] => Array
(
[0] => 4f98930cb1445d0a7d000001
[1] => 4f98959cb1445d0a7d000002
[1] => 4f88730cb1445d0a7d000003
)
)
)
Which would be faster:
Pull that entire array in 1 query and use in_array() to find the right id?
Pull everything from the first query except the votes and then do another mongodb query to see if that id exist in the array?

It Depends on a lot of factors that I suggest you test but IMO most of the time it would be faster to just do 2 querys

Depends on the size of the array being returned / searched.
Also different servers are doing the work, what do you mean by faster? At what scale?

Find documents based on referenced ID in MongoDB & PHP

i've got referenced Users collection object in my MongoDB Items collection. Random Item document looks like this:
ps: to clarify, i really dont want to embed Items into Users collection.
Array
(
[_id] => MongoId Object
(
[$id] => 4d3c589378be56a008000000
)
[modified] => 1295800467
[order] => 1
[title] => MyFirstItem
[user] => Array
(
[$ref] => users
[$id] => MongoId Object
(
[$id] => 4d3c55e7a130717c09000012
)
)
)
So i need to find only items, which are assigned to the specific user. Find this question of my problem, but the solution didnt work for me.
MongoDB-PHP: JOIN-like query
Here is snippet of my code, givin' me no results at all.
$user = $db->users->findOne(array("_id" => new MongoID("4d3c55e7a130717c09000012")));
$items = $db->items->find(array("user" => array('$id' => $user["_id"])));
What is the correct way to finding that data? Should i instead put an user_id as a MongoID without reference?
Spent all my day with this, thanks in advance!

Try
$items = $db->items->find(array("user.$id" => $user["_id"]));

We Keep Coding

PHP, A popular general-purpose scripting language that is especially suited to web development.