question

rickross avatar image
rickross asked

[meta] Specifics of the SQLTeam data merge

We're finally about to merge the data from into Ask.SSC. There has been some good discussion of this before, but now we're getting down to specifics. We could use your help to make sure the bases are well-covered. First, there is an issue of different tags being used to represent basically the same thing. We have uploaded CSV files of all the tags from both SQLTeam and CCS to so nobody needs to endure the aggravation of cutting and pasting pages of web data into a spreadsheet. What we'd like (and we're not exactly sure the optimal way to do it) is to have you guys decide which tags from SQLTeam you would like remapped to existing tags in SSC. We really just need a file that has the tag pairs to show what target tag to use for any original tag that you want to be mapped. If there is no mapping for a given tag, then we'll just preserve the original tag. Second, and significantly more complex, is the question of correctly combining users who have accounts in both communities. Here are some statistics: 1. There are 2664 users in SSC and 1515 in SQLTeam. 2. There are 119 users with the exact same username. 3. 115 users in both sites have the same email. 4. Of these users, 57 also have the same username. 5. There are 29 users that share some authentication key. 6. Of these, 21 have the same username on both sides. 7. 23 Users share both email and authentication key. 8. 19 share it all: auth key, email and username. So, you can see that the overlap is, at best, unclear and doesn't cover most users. We have a strategy in mind to produce a reasonable combination of the user datasets, and we think we have a good idea about how to let users combine accounts in a "self-service" mode for those who end up with multiple accounts. Hernani will discuss the merge strategy more below. Finally, there may be other important issues about merging that we have not considered, so let's get them all on the table in hopes of getting to the most useful and successful outcome. Thanks!
meta-asksscmergesqlteam
12 comments
10 |1200

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Kev Riley avatar image Kev Riley ♦♦ commented ·
If everyone's ok with it, I'm happy to go through the tags and email back to Rick - rather than lots of people give lots of different views...?
3 Likes 3 ·
Grant Fritchey avatar image Grant Fritchey ♦♦ commented ·
Go for it. If we don't like it, we'll scream really loudly. You'll hear it.
3 Likes 3 ·
Matt Whitfield avatar image Matt Whitfield ♦♦ commented ·
@rickross - Well, I'm not sure what I fall into, but I know that my accounts were the same - but now that ask.sqlteam is on OSQA - I can't login to check. I'm pretty sure I had to change my openID in order to be able to use Ask.SSC after the move - so do your matching stats account for that?
1 Like 1 ·
rickross avatar image rickross ♦♦ commented ·
@Matt, the various OpenID providers have differing interpretations of what hashkeys will work when hostnames and urls change. Google is the most widely used and also the most strict, whereas MyOpenID is more lenient. Since the hostname has not actually changed, it shouldn't have been a problem to continue to login on the new OSQA-powered version with the same OpenID that was used before. You personally have the same auth credential on both sites, so we do not anticipate a problem. In general, if there is not an exact match on the auth credentials, there is no match at all. These hashkeys for OpenID are very long and complex. There's no way for us to guess that two different hashkeys actually belong to the same user. PS - The only change in your OpenID from the original SE data dump was a switch from *http:* to *https:* (which seems more appropriate for a secure login credential, anyway.)
1 Like 1 ·
hernani avatar image hernani commented ·
@Matt, I believe you fall in #8.
0 Likes 0 ·
Fatherjack avatar image Fatherjack ♦♦ commented ·
@ everyone. should we take this offline rather than gum up the forum. I'm happy to share email. @Kev yup, you doing all the work is fine by me ;) seriously though, do a bunch and then pass it on and I'll finish/do some and pass it on etc...
0 Likes 0 ·
hernani avatar image hernani commented ·
It's my belief that there aren't that many tags that need to be merged, except for the sql2005 vs sql-server-2005, and similar. In sqlteam, only the few 10 tags or so have some relevant usage.
0 Likes 0 ·
ThomasRushton avatar image ThomasRushton ♦♦ commented ·
@Kev - if you need some help, I'm not working this week. Well, not *work* work - "just" childcare...
0 Likes 0 ·
Mark avatar image Mark commented ·
I got this error a few minutes ago: "Caught NameError while rendering: global name 'now' is not defined." If you want more specifics, let me know, I have a whole page of them. The errors and code were exposed.
0 Likes 0 ·
Melvyn Harbour avatar image Melvyn Harbour commented ·
@Mark, yes, we are having a problem with a section loading very slowly, and it is only here, and that was caused by me trying to debug it.
0 Likes 0 ·
ThomasRushton avatar image ThomasRushton ♦♦ commented ·
Ah. Were you also responsible for the various "span" messages I saw on the site a few minutes back?
0 Likes 0 ·
hernani avatar image hernani commented ·
@Thomas, yes :) The problem was bugging me cause I couldn't reproduce it anywhere else except in this particular site. And btw, the previous comment was not Melvyn Harbour, it was me, I was just logged in as him cause he's the site owner and has special privileges.
0 Likes 0 ·
hernani avatar image
hernani answered
I would like to give my suggestions regarding tags. I believe that the ones to be preserved are the ones from ask.sqlservercentral. So besides sql200X to sql-server-200X, the most obvious ones are stored-procedure to stored-procedures and fulltextsearch to full-text.
2 comments
10 |1200

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Matt Whitfield avatar image Matt Whitfield ♦♦ commented ·
@hernani - I think Kev Riley is coming up with a full mapping for you...
0 Likes 0 ·
hernani avatar image hernani commented ·
Ok, thanks.
0 Likes 0 ·
hernani avatar image
hernani answered
We've put a test site with a full merge and very recent data here: Please give your feedback and if there is anything that should be fixed, prior to the definitive merging. EDIT: Please upvote or downvote this answer if you agree/disagree that the merge can proceed.
23 comments
10 |1200

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Mark avatar image Mark commented ·
Hmm... snappy. I like it.
0 Likes 0 ·
ThomasRushton avatar image ThomasRushton ♦♦ commented ·
Login issues. Tried giving it my Google login ID, and it asked me to provide screen name. Gave it my usual screen name, and it's (surprise) already in use... but it won't let me in with that name...
0 Likes 0 ·
hernani avatar image hernani commented ·
@Thomas, forgot to mention that google and yahoo logins won't work, because it's running on a different domain.
0 Likes 0 ·
Matt Whitfield avatar image Matt Whitfield ♦♦ commented ·
@hernani I have a problem...
0 Likes 0 ·
Matt Whitfield avatar image Matt Whitfield ♦♦ commented ·
Check out and then
0 Likes 0 ·
Show more comments

Write an Answer

Hint: Notify or tag a user in this post by typing @username.

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.