ansaurus

Question

ruby 1.9: how do I get a byte-index-based slice of a String?

Answer 1

+1 A:

You can do this too: s.bytes.to_a[ix...pos].join(""), but that looks even more esoteric to me.

If you're calling the line several times, a nicer way to do it could be this:

class String
  def byteslice(*args)
    self.dup.force_encoding("ASCII-8BIT").slice(*args).force_encoding("UTF-8")
  end
end

s.byteslice(ix...pos)

dvyjones 2009-12-14 14:30:49

This is just stashing away the same code. I was wondering if there is not indeed a char slicer in ruby19.

kch 2009-12-14 17:11:58

Sorry, but you seem to be a bit ambigual(?) here, do you want a char slicer or a byte slicer? As per ruby 1.9.1, there is no byte slicer without a bit of hacking. I personally like the first code in my answer the best, but that's up to you to choose.

dvyjones 2009-12-14 17:50:53

I guess there isn't. Sometimes you have to add your own special use case to the Ruby standard library, and that's why the whole idea of open classes was invented.

Ken Bloom 2009-12-14 17:50:57

Oops, I meant byte slicer.

kch 2009-12-15 04:54:09

As foro your first solution, it reads prettier, and I'm generally all for that, but this particular line runs a million times, so I'll go with the one that performs best. And that's also why I'm particularly interested in ruby having its own internal implementation, because then it'd probably be in C in the base string code.

kch 2009-12-15 04:57:18

Answer 2

+1 A:

Doesn't String#bytes do what you want? It returns an enumerator to the bytes in a string (as numbers, since they might not be valid characters, as you pointed out)

str.bytes.to_a.slice(...)

Marc-André Lafortune 2009-12-14 15:05:27

But I still need a substring, not an array of characters. It seems your suggestion would only make the operation a lot more expansive with going from enum to array and then making a utf8 string out of that again.

kch 2009-12-14 17:12:50

ansaurus

tags:

views:

answers:

ruby 1.9: how do I get a byte-index-based slice of a String?

related questions