(Note to users who are a bit confused: Dilaton has just made me an author on this blog, which explains how I am posting this)
To have a good base of posts, we will likelily be importing two sets of data dumps from Stack Exchange:
- The entire Theoretical Physics Stack Exchange (TP.SE) data dump
- Part of the Physics Stack Exchange (Phys.SE) data dump
Also, there are only 413 questions from TP.SE, whereas I have estimated that there are around 4335 possibly interesting questions from Physics.SE.
To attribute posts from Physics.SE, however, we need to provide:
- The link to the question on Physics.SE
- The link to the OP’s user profile on Physics.SE
- A note indicating that the question is from Physics.SE
At least that’s how we are told to interpret CC-by-SA.
I’d also think that (not for legal-reasons, but to just attribute the effort by editors) it is important to attribute the editors, but not give links to their profile.
Now, I made query using Data Stack Exchange (Data.SE), and there happen to be 12054 interesting questions.
However, luckily for us, there is a lot of double – counting, so using an existing query, it appears that each question has approximately an average of 2.78 tags, so I estimate that we will be importing around 4336 interesting questions from Phys.SE.
Now, I think it is totally impractical for us to go about manually tagging all the posts with the right attributions.
So, we’d need a
script to help us tag all imported questions as such.
I think that the next important post would be to discuss the different settings to enable on the Admin Dashboard of the site.