I would like to contribute to the DS/ML and CS sections
I’m of the opinion that DS and ML should have separate sections [Provided we have enough volume for both], though - they are different enough, even if they’re often coupled together.
Can you elaborate a bit about those differences? Could be important… I would also vote for separated categories if they make sense. Collisions make the categorization a bit vague…
Note: Copied from my response in the Gitter chat room
I like the dichotomy between data science and “machine learning”/“statistical learning”. Although I believe that machine learning is a subset of what goes on within data science (which I believe is @evaristoc’s argument for keeping them under one tag), others outside have quite a distinction for those focusing on just machine learning algorithms (e.g. testing out algorithms on Kaggle) and all the rest of data science people don’t talk much about (e.g. data cleaning).
Although this separation is not fair on the other important topics of data science, I think people have interests in machine learning specifically that it may warrant its own tag. And I bet this is probably what @QuincyLarson was thinking when he listed them separately in his forum post proposal.
At the end of the day, I feel the bigger question may be what is the pros and cons for having one versus the other. One benefit for separating the two is to have a more focused tagging of forum posts, which I think would be machine learning and everything data science non-machine learning. However, now you have two tags, which are really just subsets of each other. So if you have one tag, it’ll be easier to classify everything as just one tag of “data science”. But then you have a problem of searching well if you go by tags. Eh, but I guess the search functionality of Discourse should be good enough to catch various search terms?
tl;dr I like splitting data science and machine learning into two because external forums/the internet I feel tends to lump into these two groups in terms of frequency of discussion
I also agree with @evaristoc that in theory Machine Learning is a subset of Data Science and working with data. As @erictleung pointed out, not everyone will agree on this, and many people would miss Machine Learning-related posts if they’re lumped in with Data Science posts. So I also think we should have separate categories for them.
This list assumes contributors will be responsible for helping nurture only a single category, though in practice many contributors will contribute posts and comments to threads in these categories.
I don’t know how granular you want to get with these categories, but maybe a category for React, which has it’s own problems. I was once in a forum that had a category “Noob Questions” which was nice because it was less threatening to beginners and kept those basic questions out of other categories. We also seem to get a lot of questions along the lines of “how do I learn” or “what’s the best path” - it might be nice to have a category for these, like “Learning”.
I think this the categorisation is a good idea, but I’m not sure currently how populated some of those will be (though simply adding them is probably the best way to find out). One thing I’ve noticed (and I’m not sure of a good way to categorise) is the questions relating to other languages - normally Java/C/C++, which appear far more frequently than data science/ML/security/testing questions - some other tag (other languages? That’s not very clear though) rather than the generic “help” might be good. Also I can’t remember seeing any gamedev/DevOps questions at all (maybe connected data point - prospective game Devs don’t seem to overlap much with prospective other programmers, community wise). I’d definitely be interested in adopting a category though.
+1 on React I think though (also Node?), particularly with it being the framework used for the beta, questions come up regularly enough for it to be discrete from the JS tag imo