CFLib.org – Common Function Library Project

detableize(string)

Last updated August 26, 2005

author

Jared Rypka-Hauer

Version: 1 | Requires: CF6 | Library: StrLib

Description:
Handy for scraping screens. This will strip all table, tr, td, and th tags from a string. It also removes leading whitespace and extra newline characters to eliminate code formatting.

Return Values:
Returns a string.

Example:

<cfset htmlString = "<table>
    <tr>
        <th width=""18"" height=""11"">
            this
        </th>
    </tr>
    <tr>
        <td>
     | | is (test whitespace stripping)
        </td>
    </tr>
    <tr bgcolor=""red"">
        <td>
            some
        </td>
    </tr>
    <tr>
        <td>
            test
        </td>
    </tr>
    <tr>
        <td>
            code
        </td>
    </tr>
</table>">

<cfoutput>
#htmlCodeFormat(htmlString)#
<br><br>
#htmlCodeFormat(detableize(htmlString))#
</cfoutput>

Parameters:

Name Description Required
string String to format. Yes

Full UDF Source:

<!---
 Strips all table and table content tags and extra whitespace from a string.
 
 @param string      String to format. (Required)
 @return Returns a string. 
 @author Jared Rypka-Hauer (jared@web-relevant.com) 
 @version 1, August 25, 2005 
--->
<cffunction name="detableize">
    <cfargument name="string" type="string" required="true" />
    <cfset var outputString = arguments["string"]>
    <cfset outputString = reReplaceNoCase(outputString , "</*table>", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "</*t[rhd](\s*\w*=*""*\w*""*)*>", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "(?m)^\s*", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "\n{2,}", "#chr(10)#", "all")>
    <cfreturn outputString />
</cffunction>

Search CFLib.org


Latest Additions

Raymond Camden added
QueryDeleteRows
November 04, 2017

Leigh added
nullPad
May 11, 2016

Raymond Camden added
stripHTML
May 10, 2016

Kevin Cotton added
date2ExcelDate
May 05, 2016

Raymond Camden added
CapFirst
April 25, 2016

Created by Raymond Camden / Design by Justin Johnson