hi, Iam writing a python script that gets articles from a MySQL DB, splits the article in its words, gets all synonyms for that word and saves it back to the MySQL server.
the problem now is that there are duplicate synonyms for each article.
Iam using three tables:
articles with aID, article
synonyms with sID, synonym
and a correlation table corr with aID, sID
anyone got an idea how to remove duplicates from the tables synonyms and corr or a tip how to avoid this?
Name:
Anonymous2012-11-20 11:29
try a ranked query and row_number to pick a duplicate to keep, then to avoid it you could use an unique index, but why am i helping you with this?
Name:
Anonymous2012-11-20 11:43
>>2
thanks for your advice, I think I will go with the unique index
>>4
Take your edgy cat-v bandwagoning back to Reddit, Uriel.
There's a branch of mathematics behind relational database theory called relational algebra, and it's actually one of the more logically intensive fields of computer science, so I can understand why you code monkeys can't grasp it or harness its true power. Yes, mysqld to power your blog is overkill, but sqlite3 is extremely light, efficient and expressive. Suggesting flat files as an alternative is like suggesting bubble sort as an alternative to quick sort.
>>6
and good riddance!
plaintext considered retarded.
Name:
Anonymous2012-11-20 19:29
You can effectively manage technology for maximum advantage by leveraging our investment in R&D. Marlabs research specialists provide you with a full assessment of new technologies and how you can leverage the technology for business gain. We help you evaluate returns from technology investments and make the right choices, based on your unique business needs.