CFLib.org – Common Function Library Project

detableize(string)

Last updated August 25, 2005

author

Jared Rypka-Hauer

Version: 1 | Requires: CF6 | Library: StrLib

Description:
Handy for scraping screens. This will strip all table, tr, td, and th tags from a string. It also removes leading whitespace and extra newline characters to eliminate code formatting.

Return Values:
Returns a string.

Example:

<cfset htmlString = "<table>
	<tr>
		<th width=""18"" height=""11"">
			this
		</th>
	</tr>
	<tr>
		<td>
	 | | is (test whitespace stripping)
		</td>
	</tr>
	<tr bgcolor=""red"">
		<td>
			some
		</td>
	</tr>
	<tr>
		<td>
			test
		</td>
	</tr>
	<tr>
		<td>
			code
		</td>
	</tr>
</table>">

<cfoutput>
#htmlCodeFormat(htmlString)#
<br><br>
#htmlCodeFormat(detableize(htmlString))#
</cfoutput>

Parameters:

Name Description Required
string String to format. Yes

Full UDF Source:

<!---
 Strips all table and table content tags and extra whitespace from a string.
 
 @param string 	 String to format. (Required)
 @return Returns a string. 
 @author Jared Rypka-Hauer (jared@web-relevant.com) 
 @version 1, August 25, 2005 
--->
<cffunction name="detableize">
	<cfargument name="string" type="string" required="true" />
	<cfset var outputString = arguments["string"]>
	<cfset outputString = reReplaceNoCase(outputString , "</*table>", "", "all")>
	<cfset outputString = reReplaceNoCase(outputString , "</*t[rhd](\s*\w*=*""*\w*""*)*>", "", "all")>
	<cfset outputString = reReplaceNoCase(outputString , "(?m)^\s*", "", "all")>
	<cfset outputString = reReplaceNoCase(outputString , "\n{2,}", "#chr(10)#", "all")>
	<cfreturn outputString />
</cffunction>
blog comments powered by Disqus

Search CFLib.org


Latest Additions

Kevin Cotton added
date2ExcelDate
May 5, 2016

Raymond Camden added
CapFirst
April 25, 2016

Chris Wigginton added
loremIpsum
January 18, 2016

Gary Stanton added
calculateArrival...
November 19, 2015

Sebastiaan Naafs - van Dijk added
getDaysInQuarter
November 13, 2015

Created by Raymond Camden / Design by Justin Johnson