Working with HTML Document in Ruby
Ruby is very versatile programming language when it comes to HTML. This language was designed for the web in mind and as such it has a lot of methods and classes to offer for dealing with HTML pages.
One common example often time used when dealing with HTML is URL extraction. Ruby requires you to include “uri” class before you can proceed
include “uri”
text =”My http://www.site.com site on the web”
URI.extract(text) //extracts URL from the string text
URI.extract(text, [‘http’,’https’]) //extracts http and https urls from the text
URI.extract relies on regular expressions to extract needed part of HTML
Ruby has multiple other gems that allow you to convert text to HTML. One such gem is called RedCLoth. It’s to_html method convert text into HTML.
gem ‘RedCloth’
require ‘redcloth’
text = RedCloth.new%{MY_TEXT_HERE}
puts text.to_html
Ruby allows a web html document conversion into text as well.