categorization

Categorizing Programming Languages

I've worked among others with Java, Prolog, SQL and C# languages so far. I know that Java and C# are imperative, Prolog declarative and SQL somewhat declarative languages. Java and C# are object-oriented and you can develop aspect-oriented programs with them. That's already four different characteristics that certain languages might fit...

How does music fingerprinting work (for sites such as Shazam and Lala.com)?

My large (120gb) music collection contains many duplicate songs, and I've been trying to fingerprint tracks in the hopes of detecting duplicates. And since I'm a CS Major I'm very curious as to what is done out there? Nothing I do has nearly the accuracy of something like Shazam or Lala.com. How do they "hash" tracks? I have run a standa...

Designing a SQL table with hierarchy/sub-categories

Hey everyone, I have a table that looks something like this: ID | Keyword | Category | Sub-Category | Sub-Sub-Category | Sub-Sub-Sub-Category Do i need to split it up in two tables (a keyword table and a categories table with parent id) if one keyword can only belong to one category,sub-category...etc. m...

MYSQL: returning zero when no value is present, categorized monthly

I have the following two tables that record expenditure and provide expenditure category information: Table transactions: +-------+--------+--------+ | month | cat_id | amount | +-------+--------+--------+ | 1 | 2 | 3 | | 1 | 2 | 8 | | 2 | 1 | 7 | | 2 | 1 | 5 | +-------+--------+-...

What would you call "callback" or "closure" in general?

Which keyword / tag should we use to describe them when we're taking notes for them. If they are placed under the category "software design", is it appropriate? @edit: It's more about how you category things. When you are in learning, some terminology appears, so what category will you assign it in your own ontology? ...

How to sort websites into categories based on keyword content

I'm writing a webrobot which categorizes sites based on there keyword/meta/links into a predefined list of categories. I've been looking at various ontology approaches and have looked at Wordnet (for the hypernym/hyponym), ResearchCyc , WebKb and was wondering if this was as hard a problem as I'm thinking or has it been solved somewhere...

Categorising a large mysql result with php while?

I'm using the following to grab my large result set from a mysql db: $discresult = 'SELECT t.id, t.subject, t.topicimage, t.topictype, c.user_id, c.disc_id FROM topics AS t LEFT JOIN collections AS c ON t.id=c.disc_id WHERE c.user_id='.$user_id; $userdiscs = $db->query($discresult) or error('Error.', __FILE__, __LINE__, $db->error()); ...

Standards for taxonomy terms or categories

Every time I build an ecommerce application I need to invest a considerable amount of time into categorising products. Factors such as ambiguoty, duplication and subjectivity can often lead to rework and wasted time. Are there any standards available (such as ISOs) that can be applied to categorising products like computer or car parts ...

categorize websites - open source LSI?

Im looking to categorize lots of websites (millions). I can use Nutch to crawl them and get the content of the sites, but I am looking for the best (and cheapest or free) tool to categorize them. One option is to create regular expressions that look for certain keywords and categorize the sites, but there area also high end LSI type too...

Ticket Bayesian(or something else) Categorization

Hi. I search solution for ticket managment system. Do you know any commercial offers? For now I have only own dev prjects with using dspam library. Maybe I am wrong use it but it show bad results. My idea was divide all prerated ticket in 2 group: spam (it is my category) and rest to (ham - all not the same with this category). After...

Is integration testing an umbrella term and if so, what types of tests does it include?

I find the concept of 'integration testing' confusing. There seems to be quite a few explanations and scopes: Functional/acceptance testing (e.g. testing the user interface with for example, Selenium) Testing the integration of different classes/modules of software together (simply testing two or more classes together, without them doi...

Categories of tags

I'm starting a pro bono project that is the web interface to the world's largest collection of lute music and it's a challenging collection from several points of view. The pieces are largely from 1400 to 1600, but they range from the mid-1200's to present day. Needless to say, there is tremendous variability in how the pieces are catego...

Modeling question about categorization. To subtype or not to?

I need some advice on how to model this simple categorization (?) example: I have a product. A product can be of different types, such as ProductType 1, ProductType 2, and ProductType 3. All products have a part number and a name. Where they differ, is the way their prices are calculated. Products in type 1's price are dependent on how...

R - How to convert a factor to an integer\numeric in R without a loss of information

When I convert a factor to a numeric, the values change to rank values. R> m$obs [1] 0 0 1 1 1 1 3 3 3 3 3 3 3 9 9 9 9 9 9 9 9 9 11 11 12 13 13 13 13 13 13 13 14 Levels: 0 1 3 9 11 12 13 14 R> as.numeric(m$obs) [1] 1 1 2 2 2 2 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 5 5 6 7 7 7 7 7 7 7 8 I have to resort to paste() ...

Full text search in reverse mode.

We are trying to build the categorization tool which should have a keyword match. Each category has it's keywords. Each asset may belongs to many categories. In order to categorize asset we need to find all categories that match the asset: all keywords defined for a category should match(full text search) the asset. Is there any way to...

Database Structure for a Faceted Search

I am creating an eCommerce site that needs to have a faceted search tool so that customers can narrow down a product search via categories and classifications in the same style as ebuyer.com and Newegg.com (See left hand menus). I initially dived straight into designing a database that ended up similar to an EAV structure (I didn't know...