Regular Expressions - HP 9000 User Manual

Computers

Hide thumbs Also See for 9000:

Administration manual (386 pages)

Manual (165 pages)

User manual (110 pages)

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

page of 234

/ 234
Contents
Table of Contents
Bookmarks

Table of Contents

• Multi-byte characters. Finally, character handling

also

involves the correct

parsing of multi-byte character streams and the interpretation of multi-byte

characters. Multi-byte character streams may contain both single-byte and

multi-byte characters. To process this data, each byte must be identified

as either a single-byte character or as part of a multi-byte character. The

details of these and other aspects of character handling are discussed in

Appendix A.

Regular Expressions

HP-UX allows the specification of arbitrary character strings through the use of

regular expressions. For further details on their use, see the section, "Regular

Expressions", in The Ultimate Guide to the vi and ex Text Editors. The syntax

of regular expressions has been extended in HP -UX to allow use with other

character sets.

Here is one example of an internationalized regular expression:

[=e=]]

This matches the word "help" spelled with any variation of the letter "e" (for

example, e,

e, e, e).

The existing syntax of a range expression (e.g., "[a-z]") is not changed.

However, its meaning has been extended to mean "match any collating element

which falls between the two given collating elements based on the current

locale's LC_COLLATE collation sequence."

For multi-byte languages, the support in regular expressions is not as extensive.

For example, multi-byte characters are allowed as single character elements in

expressions, and they can be used in character ranges. However, the inverse of

a range

("[-a .. z]")

is not allowed with multi-byte characters in general. This

is due to restrictions in the way the codesets are implemented. Moreover, some

new features are not allowed with multi-byte codesets simply because they have

no application to Asian languages.

2-12

Introduction to NLS

Table of Contents

Chapters

Table of Contents

Regular Expressions - HP 9000 User Manual

Chapters

Related Manuals for HP 9000

Related Content for HP 9000

Table of Contents