课程 14：字符串 (Strings)

课程目录

1. 字符串基础概念
2. 字符串切片与遍历
3. 字符串方法详解
4. 字符串格式化
5. 字符串不可变性与转换

1. 字符串基础概念

1.1 什么是字符串？

字符串（String）是由字符组成的有序序列，用于表示文本数据。在Python中，字符串是内置的序列类型之一，用单引号 '' 或双引号 "" 括起来。

重要概念：字符串是不可变类型，一旦创建就不能修改其中的字符。任何"修改"操作都会创建新的字符串对象。

1.2 字符串的定义与访问

# 定义字符串的多种方式
s1 = 'hello'         # 单引号
s2 = "world"           # 双引号
s3 = '''多行
字符串'''               # 三引号，支持多行
s4 = str(123)          # 从其他类型转换

# 访问字符（索引从0开始）
print(s1[0])           # h
print(s2[-1])          # d (负索引从末尾开始)
print(s1[1:3])         # el (切片)

1.3 字符串的基本操作

# 基本操作示例
s1 = 'hello'
s2 = 'world'

# 拼接与重复
s = s1 + ' ' + s2
print(s)                    # hello world
print(s1 * 3)              # hellohellohello

# 长度与成员判断
print(len(s))              # 11
print('he' in s1)          # True
print('xyz' not in s1)     # True

# 比较操作
print('abc' < 'def')       # True (按字典序)
print('hello' == 'hello')  # True

2. 字符串切片与遍历

2.1 切片操作详解

切片语法： string[start:end:step]
• start: 起始索引（包含），默认为0
• end: 结束索引（不包含），默认为字符串长度
• step: 步长，默认为1

# 切片操作示例
text = 'abcdefghijklmnop'

# 基本切片
print(text[1:5])           # bcde
print(text[:5])            # abcde (从开头到索引5)
print(text[5:])            # fghijklmnop (从索引5到结尾)
print(text[:])             # abcdefghijklmnop (整个字符串)

# 负索引切片
print(text[-5:])           # lmnop (最后5个字符)
print(text[:-5])           # abcdefghijk (除了最后5个字符)

# 步长切片
print(text[::2])           # acegikmo (每隔一个字符)
print(text[1::2])          # bdfhjlnp (从索引1开始，每隔一个字符)
print(text[::-1])          # ponmlkjihgfedcba (反转字符串)

2.2 字符串遍历方法

# 遍历字符串的多种方法
text = 'Python'

# 方法1：直接遍历字符
for ch in text:
    print(ch, end=' ')
print()  # 换行

# 方法2：通过索引遍历
for i in range(len(text)):
    print(f"{i}: {text[i]}")

# 方法3：使用enumerate
for i, ch in enumerate(text):
    print(f"位置{i}: 字符'{ch}'")

# 方法4：反向遍历
for ch in reversed(text):
    print(ch, end=' ')

3. 字符串方法详解

3.1 查找与替换方法

核心查找方法：
• str.find(sub, start, end)：返回子串首次出现的索引，未找到返回-1
• str.rfind(sub, start, end)：从右侧开始查找
• str.index(sub, start, end)：功能同find，但子串不存在时会报错
• str.rindex(sub, start, end)：从右侧开始查找
• str.replace(old, new, count)：替换子串，count指定替换次数
• str.count(sub, start, end)：统计子串出现的次数

# 查找与替换示例
text = "Hello World Hello Python Hello"

# 查找操作
print(text.find("Hello"))          # 0
print(text.rfind("Hello"))         # 22
print(text.find("Python"))         # 17
print(text.find("Java"))           # -1 (未找到)

# 统计与替换
print(text.count("Hello"))         # 3
print(text.replace("Hello", "Hi")) # Hi World Hi Python Hi
print(text.replace("Hello", "Hi", 2)) # 只替换前2次

3.2 大小写转换方法

大小写转换：
• str.lower()：将所有字符转为小写
• str.upper()：将所有字符转为大写
• str.title()：将每个单词的首字母转为大写
• str.capitalize()：仅将字符串首字母转为大写
• str.swapcase()：大小写互换

# 大小写转换示例
s = "hello WORLD python"

print(s.lower())      # hello world python
print(s.upper())      # HELLO WORLD PYTHON
print(s.title())      # Hello World Python
print(s.capitalize()) # Hello world python
print(s.swapcase())   # HELLO world PYTHON

3.3 去除空白与填充方法

空白处理与填充：
• str.strip(chars)：去除字符串两端的指定字符，默认去除空格
• str.lstrip(chars) / str.rstrip(chars)：分别去除左侧/右侧
• str.center(width, fillchar)：将字符串居中
• str.ljust(width, fillchar) / str.rjust(width, fillchar)：左对齐/右对齐
• str.zfill(width)：用0填充到指定宽度

# 空白处理与填充示例
s = "  hello world  "
num = "42"

print(s.strip())           # hello world
print(s.lstrip())          # hello world  
print(s.rstrip())          #   hello world

print(s.center(20, '*'))   # ***hello world****
print(s.ljust(20, '-'))    # hello world-------
print(s.rjust(20, '+'))    # +++++++hello world

print(num.zfill(5))        # 00042

3.4 分割与连接方法

分割与连接：
• str.split(sep, maxsplit)：按sep分割字符串为列表
• str.rsplit(sep, maxsplit)：从右侧开始分割
• str.splitlines(keepends)：按换行符分割
• str.join(iterable)：将可迭代对象用当前字符串连接
• str.partition(sep)：将字符串分为三部分：分隔符前、分隔符、分隔符后

# 分割与连接示例
text = "apple,banana,orange,grape"
multiline = "line1\nline2\nline3"

# 分割操作
fruits = text.split(',')
print(fruits)              # ['apple', 'banana', 'orange', 'grape']

fruits2 = text.split(',', 2)  # 最多分割2次
print(fruits2)             # ['apple', 'banana', 'orange,grape']

lines = multiline.splitlines()
print(lines)               # ['line1', 'line2', 'line3']

# 连接操作
result = '-'.join(fruits)
print(result)              # apple-banana-orange-grape

# 分割为三部分
before, sep, after = text.partition(',')
print(f"前: {before}, 分隔符: {sep}, 后: {after}")

3.5 判断与检查方法

字符串检查方法：
• str.startswith(prefix)：判断是否以prefix开头
• str.endswith(suffix)：判断是否以suffix结尾
• str.isalpha()：判断是否全为字母
• str.isdigit()：判断是否全为数字
• str.isalnum()：判断是否仅包含字母和数字
• str.isspace()：判断是否全为空白字符
• str.islower() / str.isupper()：判断是否全为小写/大写

# 字符串检查示例
s1 = "Hello123"
s2 = "12345"
s3 = "   "
s4 = "hello"
s5 = "WORLD"

print(s1.isalnum())        # True
print(s2.isdigit())        # True
print(s3.isspace())        # True
print(s1.startswith("He")) # True
print(s1.endswith("23"))   # True
print(s4.islower())        # True
print(s5.isupper())        # True

4. 字符串格式化

4.1 三种格式化方式详解

格式化方式对比：
• % 格式化："Hello, %s" % name (Python 2风格)
• str.format() 方法："{} {}".format("Hello", "World") (Python 3风格)
• f-字符串：f"I'm {age} years old" (Python 3.6+，推荐)

# 字符串格式化示例
name = "Alice"
age = 18
city = "Beijing"
height = 1.65

# 1. % 格式化
print("Hello, %s" % name)
print("I'm %d years old" % age)
print("I live in %s and I'm %.2f meters tall" % (city, height))

# 2. format() 方法
print("Hello, {}".format(name))
print("I'm {} years old from {}".format(age, city))
print("Height: {:.2f}m".format(height))

# 3. f-字符串（推荐）
print(f"Hello, {name}")
print(f"I'm {age} years old from {city}")
print(f"Next year I'll be {age + 1} years old")
print(f"Height: {height:.2f}m")
print(f"Name: {name.upper()}, Age: {age}")

4.2 格式化规范详解

# 格式化规范示例
name = "Bob"
age = 25
salary = 50000.12345

# 数字格式化
print(f"Age: {age:05d}")           # Age: 00025 (5位，不足补0)
print(f"Salary: ${salary:,.2f}")   # Salary: $50,000.12 (千分位分隔符)
print(f"Percentage: {0.1234:.1%}") # Percentage: 12.3%

# 字符串对齐
print(f"Name: {name:>10}")         # Name:        Bob (右对齐)
print(f"Name: {name:<10}")         # Name: Bob       (左对齐)
print(f"Name: {name:^10}")         # Name:   Bob    (居中)

# 条件表达式
status = "active" if age >= 18 else "inactive"
print(f"Status: {status}")

5. 字符串不可变性与转换

5.1 不可变性理解

重要概念： 字符串是不可变类型，一旦创建就不能修改。任何"修改"操作都会创建新的字符串对象。

# 不可变性示例
s = "hello"

# 这些操作会报错：
# s[0] = 'H'  # TypeError: 'str' object does not support item assignment

# 正确的"修改"方式：
s = 'H' + s[1:]           # 创建新字符串
print(s)                   # Hello

# 或者使用字符串方法
s = s.replace('h', 'H')   # 创建新字符串
print(s)                   # Hello

5.2 类型转换方法

# 字符串与其他类型转换
# 其他类型转字符串
num = 123
lst = [1, 2, 3]
print(str(num))            # "123"
print(str(lst))            # "[1, 2, 3]"

# 字符串转其他类型
s1 = "123"
s2 = "3.14"
s3 = "hello"

print(int(s1))             # 123
print(float(s2))           # 3.14
print(list(s3))            # ['h', 'e', 'l', 'l', 'o']

# 安全转换（处理异常）
def safe_int(s):
    try:
        return int(s)
    except ValueError:
        return None

print(safe_int("123"))     # 123
print(safe_int("abc"))     # None

学习建议：字符串是Python中处理文本数据的基础工具。建议多动手实践，从简单的字符串操作开始，逐步掌握高级技巧。记住，理解字符串的不可变性是关键，掌握常用方法能大大提高编程效率。

思考题：如何设计一个高效的字符串匹配算法？在什么情况下使用内置的字符串方法，什么情况下需要自己实现算法？这个问题的解决思路是什么？