diff options
author | Michael Koziarski <michael@koziarski.com> | 2006-10-03 23:45:32 +0000 |
---|---|---|
committer | Michael Koziarski <michael@koziarski.com> | 2006-10-03 23:45:32 +0000 |
commit | f238d495b70a264abdb864fe8107e02766b285b4 (patch) | |
tree | cfe1f5df118b46d1426cfc87326c26c8fbe63a85 /activesupport/lib/active_support/core_ext/string | |
parent | 8cb0079feabe011b7edd1c65114efdb7047a02ec (diff) | |
download | rails-f238d495b70a264abdb864fe8107e02766b285b4.tar.gz rails-f238d495b70a264abdb864fe8107e02766b285b4.tar.bz2 rails-f238d495b70a264abdb864fe8107e02766b285b4.zip |
Add ActiveSupport::Multibyte. Provides String#chars which lets you deal with strings as a sequence of chars, not of bytes. Closes #6242 [Julian Tarkhanov, Manfred Stienstra & Jan Behrens]
git-svn-id: http://svn-commit.rubyonrails.org/rails/trunk@5223 5ecf4fe2-1ee6-0310-87b1-e25e094e27de
Diffstat (limited to 'activesupport/lib/active_support/core_ext/string')
-rw-r--r-- | activesupport/lib/active_support/core_ext/string/unicode.rb | 42 |
1 files changed, 42 insertions, 0 deletions
diff --git a/activesupport/lib/active_support/core_ext/string/unicode.rb b/activesupport/lib/active_support/core_ext/string/unicode.rb new file mode 100644 index 0000000000..0e4c76fb86 --- /dev/null +++ b/activesupport/lib/active_support/core_ext/string/unicode.rb @@ -0,0 +1,42 @@ +module ActiveSupport #:nodoc: + module CoreExtensions #:nodoc: + module String #:nodoc: + # Define methods for handeling unicode data. + module Unicode + # +chars+ is a Unicode safe proxy for string methods. It creates and returns an instance of the + # ActiveSupport::Multibyte::Chars class which encapsulates the original string. A Unicode safe version of all + # the String methods are defined on this proxy class. Undefined methods are forwarded to String, so all of the + # string overrides can also be called through the +chars+ proxy. + # + # name = 'Claus Müller' + # name.reverse #=> "rell??M sualC" + # name.length #=> 13 + # + # name.chars.reverse.to_s #=> "rellüM sualC" + # name.chars.length #=> 12 + # + # + # All the methods on the chars proxy which normally return a string will return a Chars object. This allows + # method chaining on the result of any of these methods. + # + # name.chars.reverse.length #=> 12 + # + # The Char object tries to be as interchangeable with String objects as possible: sorting and comparing between + # String and Char work like expected. The bang! methods change the internal string representation in the Chars + # object. Interoperability problems can be resolved easily with a +to_s+ call. + # + # For more information about the methods defined on the Chars proxy see ActiveSupport::Multibyte::Chars and + # ActiveSupport::Multibyte::Handlers::UTF8Handler + def chars + ActiveSupport::Multibyte::Chars.new(self) + end + + # Returns true if the string has UTF-8 semantics (a String used for purely byte resources is unlikely to have + # them), returns false otherwise. + def is_utf8? + ActiveSupport::Multibyte::Handlers::UTF8Handler.consumes?(self) + end + end + end + end +end
\ No newline at end of file |