this post was submitted on 31 Mar 2025
3 points (100.0% liked)

Machine Learning

1937 readers
1 users here now

founded 4 years ago
MODERATORS
 

Hello!

I did a map generator(it's pixel art and the largest are 300x200 pixels) some time ago and decided to generate 3 types of map sizes and 1500 maps for each size to train a model to practice and I thought to do that dataset open source.

Is that really something that people want/appreciate or not really? I'm a bit lost on how to proceed and what license to use. Does it make sense to use an MIT License? Or which one do you recommend?

thanks!

top 6 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 3 days ago

If anyone is interested I just published the repo :)

You can check it here: https://github.com/4Robato/Rogue-Fantasy-Maps-Dataset/tree/main

[–] [email protected] 2 points 3 days ago (2 children)

Sounds pretty neat. Licensing can be pretty complex but MIT is a pretty much no-frills license that let's them do with your dataset what they want. CC0 (public domain) is similar.

Alternatively you can also use something like CC-BY license which also let's people use it but it requires attribution.

A step beyond that is the CC-BY-SA which is similar but requires anything new created with the data to be licensed under the same license (share alike).

Just depends on what you want to do, and what you want people to do when they use your data. Id recommend MIT, CC0, or the CC-BY-4.0 license since these ensure the most people can use it if that's your goal

[–] [email protected] 2 points 3 days ago

That was very useful! I don't really care what people do but since I want to release a possible game that generates those maps I think I'll go by CC-BY like you said :)

Thanks for the info!

[–] [email protected] -1 points 3 days ago (1 children)

I don’t understand what you’re talking about and therefore I don’t know what you mean by “dataset” in this context, but I do know that generally speaking in the US, data isn’t copyrightable.

[–] [email protected] 2 points 3 days ago
  • The OP mentions the "dataset" is composed of maps they created and those works would be copyrightable if they wanted. Additionally the arrangement of the works and composition of the works in the dataset might also be copyrightable.
  • licensing extends beyond copyrights and clarifies terms of use to protect both creator and users even when copyrightability might be debatable in some jurisdictions.