sql >> Database > >> RDS >> PostgreSQL

Pyspark:UTF-nulteken verwijderen uit pyspark-dataframe

Ah wacht - ik denk dat ik het heb. Als ik zoiets doe, lijkt het te werken:

null = u'\u0000'
new_df = df.withColumn('e', regexp_replace(df['e'], null, ''))

En dan toewijzen aan alle stringkolommen:

string_columns = ['d','e']
new_df = df.select(
  *(regexp_replace(col(c), null, '').alias(c) if c in string_columns else c for
    c in df.columns)
  )

Postgresql -- opschonen HTML-tags in het midden van string Kan .csv niet importeren in Cloud SQL vanuit de console

INNER JOIN Waar clausule
Grote resultaatsets streamen met MySQL 4.1.x + Connector/J
Hoe QUARTER() werkt in MariaDB
mysql NULL-waarde in waar in CLAUSE