ansaurus

Question

C program to remove repeated char from a string

Answer 1

+10 A:

when str[i] is a non-alphabet, say a space and when you do:

hash[str[i] - 'a']

your program can blow.

ASCII value of space is 32 and that of a is 97 so you are effectively accessing array hash with a negative index.

To solve this you can ignore non-alphabets by doing :

if(! isalpha(str[i]) {
    str[j++] = str[i++]; // copy the char.
    continue;  // ignore rest of the loop.
}

codaddict 2010-02-11 11:31:28

You might need to actually do an explicit check for lower case only as well?

Douglas Leeder 2010-02-11 11:45:08

Answer 2

+2 A:

This is going to break on any space characters (or anything else outside the range 'a'..'z') because you are accessing beyond the bounds of your hash array.

Paul R 2010-02-11 11:31:58

Answer 3

+1 A:

...

// iterate through the input string char by char.
for(i=0,j=0;str[i];)
{
  if (str[i] == ' ')
  {
    str[j++] = str[i++];
    continue;
  }

    // if the char is not hashed.
    if(!hash[str[i] - 'a'])
    {

...

tur1ng 2010-02-11 11:33:38

If there is a non-alphabet other than space, say like a '?', this will fail again.

gameover 2010-02-11 11:39:42

@gameover: Good point

tur1ng 2010-02-11 11:45:20

Answer 4

+2 A:

This is code golf, right?

d(s){char*i=s,*o=s;for(;*i;++i)!memchr(s,*i,o-s)?*o++=*i:0;*o=0;}

finnw 2010-02-11 12:01:38

You should ignore spaces.

KennyTM 2010-02-11 12:14:50

You should post that to IOCCC... ;)

tommieb75 2010-02-11 12:56:04

Answer 5

+1 A:

#include <stdio.h>
#include <string.h>

int hash[26] = {0};

static int in_valid_range (char c);
static int get_hash_code (char c);

static char * 
remove_repeated_char (char *s)
{
  size_t len = strlen (s);
  size_t i, j = 0;
  for (i = 0; i < len; ++i)
    {
      if (in_valid_range (s[i]))
    {
      int h = get_hash_code (s[i]);
      if (!hash[h])
        {
          s[j++] = s[i];
          hash[h] = 1;
        }
    }
      else
    {
      s[j++] = s[i];
    }
    }
  s[j] = 0;
  return s;
}

int
main (int argc, char **argv)
{
  printf ("%s\n", remove_repeated_char (argv[1]));
  return 0;
}

static int 
in_valid_range (char c)
{
  return (c >= 'a' && c <= 'z');
}

static int 
get_hash_code (char c)
{
  return (int) (c - 'a');
}

Vijay Mathew 2010-02-11 12:07:52

Two things: in_valid_range() is just islower(), so can be removed for brevity; get_hash_code() (and all treatments of hash codes) should be unsigned, since 'char' might not be, thus (int) (c - 'a') can be negative.

unwind 2010-02-11 12:11:04

Answer 6

+1 A:

char *s;
int i = 0;

for (i = 0; s[i]; i++)
{
    int j;
    int gap = 0;
    for (j = i + 1; s[j]; j++)
    {
        if (gap > 0)
            s[j] = s[j + gap];
        if (!s[j])
            break;
        while (s[i] == s[j])
        {
            s[j] = s[j + gap + 1];
            gap++;
        }
    }
}

Phil Wallach 2010-02-11 12:09:47

Answer 7

+1 A:

void striprepeatedchars(char *str)
{
    int seen[UCHAR_MAX + 1];
    char *c, *n;

    memset(seen, 0, sizeof(seen));

    c = n = str;
    while (*n != '\0') {
        if (!isalpha(*n) || !seen[(unsigned char) *n]) {
            *c = *n;
            seen[(unsigned char) *n]++;
            c++;
        }
        n++;
    }
    *c = '\0';
}

jim 2010-02-11 12:54:25

ansaurus

tags:

views:

answers:

C program to remove repeated char from a string

related questions