ansaurus

Question

Answer 1

A:

Python, O(n^2)

import sys;
words=sys.stdin.readlines()
def s(x):return sorted(x.lower());
print '\n'.join([''.join([a.replace('\n',' ') for a in words if(s(a)==s(w))]) for w in words])

codewarrior 2010-04-02 10:01:51

This is... incredibly slow. :P It also outputs each set of anagrams multiple times, equal to the number of anagrams in the set.

Amber 2010-04-02 10:17:48

(Oh, and it also outputs individual words which aren't anagrams on their own line, which doesn't appear to be part of the specified output.)

Amber 2010-04-02 10:29:47

Why ; at end of line, that is not Python!

Tony Veijalainen 2010-10-04 20:43:47

Answer 2

+2 A:

Python, 167 characters, includes I/O

import sys
d={}
for l in sys.stdin.readlines():
 l=l[:-1]
 k=''.join(sorted(l)).lower()
 d[k]=d.pop(k,[])+[l]
for k in d:
 if len(d[k])>1: print(' '.join(d[k]))

Without the input code (i.e. if we assume the wordlist already in a list w), it's only 134 characters:

d={}
for l in w:
 l=l[:-1]
 k=''.join(lower(sorted(l)))
 d[k]=d.pop(k,[])+[l]
for k in d:
 if len(d[k])>1: print(' '.join(d[k]))

Amber 2010-04-02 10:03:22

Remove the space between `:` and `print`, and use semicolons. I think you could use `input()`.

KennyTM 2010-04-02 12:56:15

This produces case-sensitive results. Compare against Dan Andreatta or my solutions.

Mark Rushakoff 2010-04-02 14:25:11

Fixed, now that case-insensitivity was specified as required.

Amber 2010-04-02 19:24:32

Python 2.6.1 on my machine claims: "NameError: name 'lower' is not defined"

MtnViewMark 2010-04-03 05:52:01

Yeah, that was a mistake on my part, I fixed it (should be `.lower()` instead of `lower(...)`).

Amber 2010-04-03 09:04:16

Answer 3

+1 A:

AWK - 119

{split(toupper($1),a,"");asort(a);s="";for(i=1;a[i];)s=a[i++]s;x[s]=x[s]$1" "}
END{for(i in x)if(x[i]~/ .* /)print x[i]}

AWK does not have a join function like Python, or it could have been shorter...

~~It assumes uppercase and lowercase as different.~~

Dan Andreatta 2010-04-02 10:33:55

Will this only print out words which are anagrams? Or will it also print out words with no other anagram on their own lines?

Amber 2010-04-02 10:37:42

The original version prints everything. Update with the fix.

Dan Andreatta 2010-04-02 10:42:15

I think this must be for Gnu awk (gawk) only. Standard awk doesn't have the function asort.

MtnViewMark 2010-04-03 05:59:17

@MtnVieweMark: you are correct, this is for `gawk`

Dan Andreatta 2010-04-03 13:22:44

Answer 4

+10 A:

Powershell, 104 97 91 86 83 chars

$k=@{};$input|%{$k["$([char[]]$_|%{$_+0}|sort)"]+=@($_)}
$k.Values|?{$_[1]}|%{"$_"}

Update for the new requirement (+8 chars):

To exclude the words that only differ in capitalization, we could just remove the duplicates (case-insensitvely) from the input list, i.e. $input|sort -u where -u stands for -unique. sort is case-insenstive by default:

$k=@{};$input|sort -u|%{$k["$([char[]]$_|%{$_+0}|sort)"]+=@($_)} 
$k.Values|?{$_[1]}|%{"$_"}

Explanation of the `[char[]]$_|%{$_+0}|sort` -part

It's a key for the hashtable entry under which anagrams of a word are stored. My initial solution was: $_.ToLower().ToCharArray()|sort. Then I discovered I didn't need ToLower() for the key, as hashtable lookups are case-insensitive.

[char[]]$_|sort would be ideal, but sorting of the chars for the key needs to be case-insensitive (otherwise Cab and abc would be stored under different keys). Unfortunately, sort is not case-insenstive for chars (only for strings).

What we need is [string[]][char[]]$_|sort, but I found a shorter way of converting each char to string, which is to concat something else to it, in this case an integer 0, hence [char[]]$_|%{$_+0}|sort. This doesn't affect the sorting order, and the actual key ends up being something like: d0 o0 r0 w0. It's not pretty, but it does the job :)

Danko Durbić 2010-04-02 12:22:09

This Powershell is starting to freak me out ;)

Dan Andreatta 2010-04-02 13:48:21

Looks like perl.

Alex 2010-04-03 09:06:29

I'm accepting this because 1. it conforms to all the requirements 2. it's the highest voted answer, and 3. I haven't seen much power shell code golfing before :)

Charles Ma 2010-04-04 09:28:00

Answer 5

+3 A:

Ruby, 94 characters

h={};(h[$_.upcase.bytes.sort]||=[])<<$_ while gets&&chomp;h.each{|k,v|puts v.join' 'if v.at 1}

Mark Rushakoff 2010-04-02 13:33:48

This is a pretty Perl-ish approach, so I expect that somebody will be able to knock off at least 5-10 characters with a Perl solution.

Mark Rushakoff 2010-04-02 14:27:41

Answer 6

+4 A:

Haskell, 147 chars

prior sizes: ~~150~~ ~~159~~ chars

import Char
import List
x=sort.map toLower
g&a=g(x a).x
main=interact$unlines.map unwords.filter((>1).length).groupBy((==)&).sortBy(compare&).lines

This version, at 165 chars satisifies the new, clarified rules:

import Char
import List
y=map toLower
x=sort.y
g&f=(.f).g.f
w[_]="";w a=show a++"\n"
main=interact$concatMap(w.nubBy((==)&y)).groupBy((==)&x).sortBy(compare&x).lines

This version handles:

Words in the input that differ only by case should only count as one word
The output needs to be one anagram set per line, but extra punctuation is acceptable

MtnViewMark 2010-04-02 14:20:35

Just my first attempt -- I'm sure it can be squeezed.

MtnViewMark 2010-04-02 14:21:15

It's so nice to see that something close to idiomatic, readable Haskell also scores pretty well in code golf!

Thomas 2010-04-03 11:31:18

Agreed, this is probably my favorite answer :)

Charles Ma 2010-04-04 09:30:04

Answer 7

A:

C++, 542 chars

#include <iostream>
#include <map>
#include <vector>
#include <boost/algorithm/string.hpp>
#define ci const_iterator
int main(){using namespace std;typedef string s;typedef vector<s> vs;vs l;
copy(istream_iterator<s>(cin),istream_iterator<s>(),back_inserter(l));map<s, vs> r;
for (vs::ci i=l.begin(),e=l.end();i!=e;++i){s a=boost::to_lower_copy(*i);
sort(a.begin(),a.end());r[a].push_back(*i);}for (map<s,vs>::ci i=r.begin(),e=r.end();
i!=e;++i)if(i->second.size()>1)*copy(i->second.begin(),i->second.end(),
ostream_iterator<s>(cout," "))="\n";}

TC 2010-04-02 14:57:48

Notes: (1) Actual count as shown is a little higher because I threw in some newlines for readability (main() can be on a single line) (2) Slightly compressed yet readable version: http://codepad.org/JtB1AHrE

TC 2010-04-02 15:00:13

Answer 8

+10 A:

Perl, 59 characters

chop,$_{join'',sort split//,lc}.="$_ "for<>;/ ./&&say for%_

Note that this requires Perl 5.10 (for the say function).

Michael Carman 2010-04-02 16:28:49

MtnViewMark 2010-04-03 05:54:52

@MtnViewMark: To protect backward compatibility the features added to Perl in version 5.10 must be explicitly enabled: `perl -M5.010 ana-perl.pl < wordlist` How that should effect golf scores is debatable.

Michael Carman 2010-04-04 18:33:46

ansaurus

tags:

views:

answers:

Code golf: find all anagrams

Task:

Input:

Output:

EDIT: Clarifications

EDIT2: More Clarifications

Python, O(n^2)

Python, 167 characters, includes I/O

AWK - 119

Powershell, 104 97 91 86 83 chars

Update for the new requirement (+8 chars):

Explanation of the `[char[]]$_|%{$_+0}|sort` -part

Ruby, 94 characters

Haskell, 147 chars

C++, 542 chars

Perl, 59 characters

related questions

ansaurus

tags:

views:

answers:

Code golf: find all anagrams

Task:

Input:

Output:

EDIT: Clarifications

EDIT2: More Clarifications

Python, O(n^2)

Python, 167 characters, includes I/O

AWK - 119

Powershell, 104 97 91 86 83 chars

Update for the new requirement (+8 chars):

Explanation of the [char[]]$_|%{$_+0}|sort -part

Ruby, 94 characters

Haskell, 147 chars

C++, 542 chars

Perl, 59 characters

related questions

Explanation of the `[char[]]$_|%{$_+0}|sort` -part